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This  dissertation  addresses  the  problem  of  detecting  known, 
discrete-time  signals  in  additive  non-Gaussian  noise.  The  case  of 
statistically  independent  samples  is  emphasized.  After  a  brief 
introduction  to  the  detection  problem,  the  characteristics  and 
sources  of  impulsive  noise  are  discussed.  Several  models  for 
impulsive  noise  are  then  presented.  The  complexity  of  these 
models  and  the  need  for  simple  density  functions  to  approximate 
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(Abstract  con't.) 

the  first  order  characteristics  of  impulsive  noise  justify 
consideration  of  three  systems  of  densities.  These  three 
systems  are:  a  generalized  Gaussian  noise,  the  Johnson  Su 

System,  and  a  mixture  model.  These  are  used  throughout  this 
dissertation  to  provide  examples. 

In  many  detection  problems  it  may  only  be  possible  to 
define  a  class  of  probability  densities  which  contains  the 
actual  noise  density.  In  such  cases  minimax  detectors  may  be 
used  to  guarantee  a  lower  bound  on  detector  performance  for  the 
entire  class.  The  minimax  detector  is  the  optimum  detector  for 
the  worst  case  density.  It  is  shown  that  the  worst  case  density, 
in  terms  of  minimizing  the  asymptotic  probability  of  detecting 
a  signal,  is  that  density  which  minimizes  Fisher's  Information 
over  the  entire  class.  Several  classes  of  densities  are  con¬ 
sidered  and  conditions  are  established  for  the  minimax  detectors. 
It  is  also  shown  that  impulsive  noise  with  a  large  value  for 
Fisher's  Information  provides  a  better  environment  for  detection 
than  does  a  Gaussian  noise  of  equal  power.  The  increased 
structure  of  the  impulsive  noise  may  be  exploited  to  increase 
detector  performance  over  that  possible  for  Gaussian  noise. 
Nonlinear  processing,  however,  is  required. 

An  adaptive  detector  which  uses  the  Johnson  System  as  a 
noise  model  is  then  presented.  After  a  study  of  different 
measures  of  tail  behavior,  a  scheme  to  adapt  a  detector 
utilizing  quantiles  to  measure  the  tail  behavior  of  the  first 
order  density  function  of  the  noise  is  developed.  Simulation 
results  are  presented  which  indicate  that  this  detector  will 
perform  well  over  a  large  variety  of  background  noises.  These 
simulations  also  support  the  statement  that  non-Gaussian  noise, 
with  large  value  for  Fisher's  Information,  provides  a  better 
environment  for  detection  than  does  Gaussian  noise. 

Suggestions  for  continuing  this  research  are  also 
included. 
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ABSTRACT 


This  dissertation  addresses  the  problem  of  detecting  known,  discrete- 
time  signals  in  additive  non-Gauss ian  noise.  The  case  of  statistically 
independent  samples  is  emphasized.  After  a  brief  introduction  to  the  de¬ 
tection  problem,  the  characteristics  and  sources  of  impulsive  noise  are 


discussed.  Several  models  for  impulsive  noise  are  then  presented.  The 


complexity  of  these  models  and  the  need  for  simple  density  functions  to 
approximate  the  first  order  characteristics  of  impulsive  noise  justify 
consideration  of  three  systems  of  densities.  These  three  systems  are: 


a  generalized  Gaussian  noise,  the  Johnson  5^  System,  and  a  mixture  model. 
These  are  used  throughout  this  dissertation  to  provide  examples. 

In  many  detection  problems  it  may  only  be  possible  to  define  a  class 
of  probability  densities  which  contains  the  actual  noise  density.  In  such 
cases  minimax  detectors  may  be  used  to  guarantee  a  lower  bound  on  detector 
performance  for  the  entire  class.  The  minimax  detector  is  the  optimum 
detector  for  the  worst  case  density.  It  is  shown  that  the  worst  case 
density,  in  terms  of  minimizing  the  asymptotic  probability  of  detecting 
a  signal,  is  that  density  which  minimizes  Fisher's  Information  over  the 


entire  class.  Several  classes  of  densities  are  considered  and  conditions 


are  established  for  the  minimax  detectors.  It  is  also  shown  that  impulsive 


noise  with  a  large  value  for  Fisher's  Information  provides  a  better 
environment  for  detection  than  does  a  Gaussian  noise  of  equal  power.  The 


increased  structure  of  the  impulsive  noise  may  be  exploited  to  increase 
detector  performance  over  that  possible  for  Gaussian  noise.  Non-linear 
processing,  however,  is  required. 

An  adaptive  detector  which  uses  the  Johnson  System  as  a  noise 
model  is  then  presented.  After  a  study  of  different  measures  of  tail 
behavior,  a  scheme  to  adapt  a  detector  utilizing  quantiles  to  measure 
the  tail  behavior  of  the  first  order  density  function  of  the  noise  is 
developed.  Simulation  results  are  presented  which  indicate  that  this 
detector  will  perform  well  over  a  large  variety  of  background  noises. 
These  simulations  also  support  the  statement  that  non-Gaussian  noise, 
with  large  values  for  Fisher's  Information,  provides  a  better  environ¬ 
ment  for  detection  than  does  Gaussian  noise. 

Suggestions  for  continuing  this  research  are  also  included. 
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INTRODUCTION 


This  dissertation  addresses  the  problem  of  detecting  known, 
discrete-time  signals  in  additive  non-Gaussian  noise.  This 
problem  may  arise  in  communications,  radar,  sonar,  and  other 
situations  where  the  dominant  noise  may  be  impulsive  in  nature. 
For  example,  low  frequency  radio  channels  are  plagued  by  highly 
impulsive  lightning  strokes  [1].  Higher  frequencies  must  contend 
with  unintentionally  generated  man-made  noise  .122  •  Among  the 
sources  of  impulsive  noise  on  the  sonar  channel  [3]  are  ice 
cracking  and  snapping  shrimp.  A  more  detailed  presentation  of 
the  sources  and  characteristics  of  impulsive  noise  is  given  in 
Chapter  II. 

Throughout  this  dissertation  the  noise  samples  will  be 
considered  independent.  Although  this  assumption  is  not  in 
general  valid  [4],  it  allows  n**1  order  noise  density  functions 
to  be  expressed  as  products  of  first  order  densities  thus  greatly 
simplifying  the  detection  problem.  For  many  of  the  results  of 
this  dissertation,  the  independence  assumption  can  be  relaxed 
using  the  weaker  condition  of  strong  mixing  (see,  for  example. 
Chapter  III  -  Section  1). 


The  detection  problem,  with  which  this  dissertation  is 
concerned,  is  perhaps  best  expressed  as  a  hypothesis  testing 
problem : 


H:  *  n^ 

K:  Xi  «  ni  +  6si 


0  >  0,  i=l,  . . .  ,N 


H  is  the  hypothesis  that  the  observations  {x^  consist  of 

noise  only.  The  alternative  K  is  that  in  addition  to  the  noise, 

a  known  signal  {Ss^^  is  present.  The  {n^  are  the 

realizations  of  the  random  variables  which  may  be  formed 

from  sampling  an  underlying  cortinuous-time  random  noise  process. 

N 

Throughout  this  dissertation  the  {iT  are  assumed  to  be  indepen- 

N 

dent  random  variables  with  densities  where  f .  e  F  some 

class  of  symmetric  densities.  The  s^  are  assumed  to  have  their 
absolute  values  upper  and  lower  bounded  by  positive  constants 
and  so  that  0  <  m1  <  ls^  l  <  <  •. 

The  optimum  detector,  in  terms  of  maximizing  the  probability 
of  detection  while  keeping  the  probability  of  false  alarm  below  a 
certain  level,  is  given  by  the  Neyman-Pearson  lemma  [5].  This 
detector  compares  the  likelihood  ratio  to  some  threshold  which  is 
chosen  to  achieve  the  desired  probability  of  false  alarm. 

fj< (x)  (  >  Ti  decide  K 


L{x) 


<  T, 


decide  H 


f^(x)  an^  )  are  probability  density  functions  for 

the  observation  vector  X  under  K  and  H  respectively.  Since  the 
noise  samples  are  assumed  independent  and  the  signal  is  known,  the 
fK(x)  and  fy(jc)  can  be  written  as  products  of  the  univariate 
noise  densities: 


N 

■  n  fi(*i  -  esi ) 
fH(x)  =. n  f i (xi ) 

—  i*i 

The  optimum  detector  can  then  be  written  as 


( 


II  f  i  (  X:  0S:  ) 

L(x)  =  i=l  1  1  1 

N 

n  f i (xj ) 
i  =  1 

In  order  to  simplify  the  detector,  it  is  common  practice  to  take 
the  natural  logarithm  of  Eq.  (1.4).  This  yields  the 
Neyman-Pearson  optimum  detector: 


/  >  T1  decide  K 
^  <  Ti  decide  H 


N 


r« 


^opt  ^ * ) 


N  £i(*i  -  0s: ) 

£  log - 

i=l  f i ( xi ) 


>  T, 


<  T, 


decide  K 
decide  H 


C 


Figure  1.1  displays  a  block  diagram  for  this  detector.  The 
structure  is  that  of  a  time  dependent  nonlinearity 


0s  j ) 


9i  (*i )  *  9opt.  <Xi)  *  log 


r  i  ( x  i 


(3 


5- 


followed  by  a  summer  and  a  threshold  comparator.  Unfortunately, 
this  nonlinearity  is  often  difficult  to  implement. 

In  small  signal  cases  a  simpler  detector,  the  locally 
optimum  detector,  is  often  used.  Unlike  the  Neyman-Pearson 
optimum  detector,  which  maximizes  the  probability  of  detection 
(from  here  on  referred  to  as  the  power  of  the  detector)  for  a 
fixed  probability  of  false  alarm  (referred  to  as  the  level),  the 
locally  optimum  detector  maximizes  the  slope  of  the  power 
function  (power  versus  signal  strength)  at  the  origin. 

Since  the  slope  of  the  power  function  corresponds  to  the  increase 
in  power  as  signal  strength  is  increased,  the  locally  optimum 
detector's  performance  should  be  nearly  as  good  as  that  of  the 
optimum  detector  for  small  signals. 

The  locally  optimum  likelihood  ratio  [6]  is 


L(x) 


ae  fK<x) 

fH(x) 


e-o 


Since  fK(x)  approaches  fj^x)  as 

L(x)  =  1_  log  f k { x ) 
-  30  ~ 


8-0, 

8-0 


L(x)  can  be  written 


Substituting  in  (1.3),  taking  the  derivative,  and  allowing  8  to 
approach  zero,  yields  the  standard  form  for  the  locally  optimum 
detector. 


N  ti'(xi) 

Tl0c(x)  =  l  - - - —  (-S^) 

1=1  fi(Xj) 


>  T„ 


<  T, 


decide  K 
decide  H 


-6- 


This  detector  has  the  same  structure  displayed  in  Figure  1.1  with 


fi ‘ (Xi  ) 

9i<xi>=  9loc:(xi>  =  -  (-s^ 

1  fi(»i) 


(1.10) 


The  time  dependent  nonlinearities  g0pt.(*)  and  glDc .  (•)  are 
related  [7]  by  the  equation 


9opt^  ^ xi ^ 


kf 


gloc. (t)dt 

i 


xi“0si 


(1.11) 


and  the  fact  that  gioc^tO  is  the  linear  term  in  the  Taylor 
expansion  of  gopt^*)5 


9opt^xi)  =  6gioc^(xi)  +  {higher  order  terms} 


(1.12) 


For  small  values  of  0/  gopt^*)  is  often  approximated  by 
®9ioCi  (  * )  • 

In  addition  to  the  power  function,  another  useful  measure  of 
detector  performance  is  provided  by  the  concepts  of  efficacy  and 
asymptotic  relative  efficiency  [6j.  Asymptotic  relative 
efficiency  (abbreviated  ARE)  is  used  to  compare  two  detectors 
based  on  the  asymptotic  ratio  of  the  number  of  samples  required 
by  each  to  achieve  the  same  level  and  power.  That  is,  if 
n^(a,g,0)  is  the  number  of  samples  required  by  Detector  1 
to  achieve  level  a  and  power  g  with  signal  strength  0  and 
^(a.g,©)  is  the  corresponding  number  for  Detector  2,  then 


n2(a,  B,  6) 

AR£i,2  *  lira  _ _ 

B+Q  n1(a,  B,e) 


(1.13) 


Thus,  if  Detector  2  requires  (in  the  limit)  twice  as  many  samples 
as  Detector  1  (n2  *  2n^),  then  ARE^  2  *  ^  an<3  one  cou^  saY 
Detector  1  is  "twice  as  efficient"  as  Detector  2.  Often  the 
linear  detector  is  used  as  a  reference  detector  for  comparing 
nonlinear  detectors  using  ARE.  ARE  is  by  definition  a  small 
signal  (and  consequently  a  large  sample  size)  concept.  It  can  be 
shown  [6]  that  the  nonlinear  detector  which  has  the  maximum  value 
of  ARE  (using  some  reference  such  as  the  linear  detector)  is  the 
locally  optimum  detector. 

Although  Eq.  (1.13)  provides  the  definition  for  ARE,  the 
concept  of  efficacy  provides  a  much  simpler  way  to  calculate  it. 

If  is  a  threshold  detector  for  the  problem  expressed  by  Eq.  (1.1) 
using  the  test  statistic  T(x), 

T(x)  /  >  T  decide  K  (1.14) 

(  <  T  decide  H 

then  the  efficacy  [8]  of  D^  is  given  by 

3  I  I2 

38  ekCt(x) 3  e-*-o 

_ ±  (1.15) 

NVarH[T(x)] 


ed, 


lim 
N  •*■<*> 


where  EK[T(x)]  and  VarH[T(x) ]  are  the  expectation  under  K  and 
the  variance  under  H  of  the  test  statistic.  The  ARE  can  then  be 
calculated  [8]  from 


-  -  * 


A 


-8- 


1 

1 

I 

i 

I 

■ 

»  . 
i" 

n 


V 


U 

f 


i 


AREp  D  « 

11  2  eD2 

Another  definition  which  will  be  used  frequently  is  Fisher's 
measure  of  information  [53  for  the  location  parameter  in  a 
density.  For  the  problem  stated  in  equation  (1.1),  Fisher’s 
Information  may  be  expressed  as 

<S> 

I(£)  "/  (tot1)2  tu,dx 

—  CD 


A  brief  outline  of  the  rest  of  this  dissertation  will  now  be 
presented.  Chapter  II  focuses  on  impulsive  noise.  First,  the 
physical  sources  and  properties  of  man-made  and  naturally 
occurring  impulsive  noise  are  discussed.  Several  models  for 
impulsive  noise  are  then  presented.  Well  known  empirical  models 
(mathematical  constructs  designed  to  fit  the  data)  and  physical 
models  (which  are  directly  related  to  the  underlying  physical 
mechanisms)  are  described.  Due  to  the  complexity  of  these  models 
and  the  need  for  simple  density  functions  to  approximate  the 
first  order  characteristics  of  impulsive  noise,  three  density 
systems  are  considered.  They  are:  a  generalized  Gaussian  noise, 
the  Johnson  Su  System,  and  a  mixture  model.  Graphs  of  the 
density  functions,  their  optimum  and  locally  optimum  non- 
linearities,  and  values  for  Fisher's  Information  are  given  for  a 
range  of  values  for  the  parameters  of  these  systems.  The  chapter 
ends  with  a  discussion  of  techniques  which  may  be  used  for  the 
detection  of  signals  in  an  impulsive  environment. 
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Frequently  in  detection  problems#  the  statistical 
characteristics  of  the  noise  are  not  well  established.  In  many 
cases,  mini-max  detectors  may  be  used  to  achieve  robustness. 
Chapter  III  presents  results  which  facilitate  the  design  of  these 
detectors.  First,  the  asymptotic  performance  (measured  by  the 
level  and  power  of  the  test)  of  optimum  detectors  is  related  to 
the  value  of  Fisher's  Information  for  the  underlying  noise 
distribution.  The  density  with  minimum  Fisher's  Information  over 
a  given  class  of  densities  is  seen  to  offer  the  worst  environment 
in  which  to  detect  signals.  This  density,  along  with  its  locally 
optimum  detector,  is  shown  to  form  a  saddlepoint.  Several 
classes  of  densities  are  discussed,  and  conditions  are 
established  for  the  mini-max  detectors. 

Chapter  IV  presents  an  adaptive  detection  scheme.  The 
Johnson  Sy  system  of  densities  is  used  as  a  noise  model. 

Quantiles  are  used  to  estimate  the  tail  behavior  of  the  first 
order  density  function  of  the  noise.  This  estimate,  along  with 
an  estimate  of  the  variance,  determines  the  detector’s 
structure.  Simulation  results  are  presented  which  indicate  that 
this  detector  will  perform  well  over  a  large  variety  of 
background  noises. 

This  dissertation  ends  with  a  brief  summary  and  some 
suggestions  for  future  research.  An  attempt  has  been  made  to 
keep  all  symbols  used  consistent  throughout  the  text.  Those 
symbols  which  were  not  presented  in  this  introduction  are 
presented  as  needed  in  the  chapters.  Chapters  II,  III,  and  IV 
are  independent  and  may  be  read  in  any  order. 
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Chapter  II 


IMPULSIVE  NOISE:  CHARACTERISTICS,  SOURCES,  AND  MODELS 

This  chapter  serves  as  an  introduction  to  impulsive 
noise.  Section  1  contains  a  brief  discussion  of  the 
characteristics  and  sources  of  impulsive  noise.  Section  2 
presents  several  of  the  more  common  impulsive  noise  models. 
Empirical  models  designed  to  fit  first  order  statistical  data, 
and  physical  models  directly  related  to  the  underlying  physical 
mechanisms,  are  described.  Most  of  these  models  are  quite 
complex  and  do  not  lend  themselves  easily  to  detection 
problems.  For  this  reason,  Section  3  presents  three  density 
function  systems  which  can  be  used  to  approximate  the  first 
order  characteristics  of  impulsive  noise.  Section  4  ends  the 
chapter  with  a  discussion  of  techniques  which  may  be  used  for 
the  detection  of  signals  in  an  impulsive  environment. 

Section  1:  Characteristics  and  Sources 

Impulsive  noise  can  be  characterized  by  a  relatively  small 


number  of  random,  high  amplitude  bursts.  Unlike  either  thermal 
noise  or  high  density  shot  noise,  each  of  which  consists  of  a 
large  number  of  sources  contributing  small  disturbances  and  thus 
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satisfies  the  conditions  of  the  Central  Limit  Theorem, 
impulsive  noise  typically  consists  of  relatively  few  bursty 
sources.  The  Central  Limit  Theorem  is  not  applicable,  and  the 
noise  is  distinctly  non-Gaussian.  The  first  order  density 
functions  usually  have  much  heavier  tails  than  that  of  the 
Gaussian  distribution.  Impulsive  noise  can  rarely,  if  ever,  be 
considered  white.  Samples  are  not  independent.  However,  due 
to  the  complexities  of  studying  n**1  order  non-Gaussian 
distributions,  most  research  has  centered  on  describing  the 
first  order  characteristics.  The  independence  assumption  is 
then  necessary  to  describe  high  order  behavior. 

Natural  Noise  Souces 


Impulsive  noise  sources  can  be  grouped  into  two 
categories:  naturally  occurring  and  man-made.  Perhaps  the 

naturally  occurring  impulsive  noise  which  has  captured  the  most 
attention  in  the  literature  £1,2,3, 4]  is  atmospheric  noise.  It 
adversely  affects  communications  systems  from  1  kHz  to  30  MHz. 
The  total  atmospheric  noise  measured  at  a  given  location  can 
usually  be  treated  as  the  sum  of  many  individual  lightning 
discharges  modified  by  the  appropriate  propagation  path.  Due 
to  the  fact  that  at  very  low  frequencies  signals  can  propagate 
several  thousands  of  miles,  even  extremely  distant  storms  can 
raise  havoc  with  radio  communications.  The  following 
description  of  lightning  discharges  is  due  essentially  to  Watt 
and  Maxwell  £l]> 
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A  lightning  discharge  consists  of  both  a  predischarge  and 
a  main  discharge.  The  predischarge  consists  of  a  series  of 
short  leader  strokes,  30  to  200  feet  in  length,  which  attempt 
to  establish  an  ionized  path  from  the  cloud  to  the  ground. 

Each  leader  stroke  is  a  current  pulse  of  approximately  300 
amperes  and  lasts  about  one  microsecond.  A  new  stroke  occurs 
every  25  to  100  microseconds.  This  process  forms  what  is 
termed  the  predischarge  or  fine  structure  of  the  lightning 
stroke.  It  typically  lasts  about  one  millisecond.  According 
to  Hall  [4],  the  energy  radiated  by  the  predischarge  has  a  3  dB 
bandwidth  of  about  40  kHz  with  a  maximum  at  approximately  30 
kHz.  The  spectrum  falls  off  as  1/f. 

Once  a  leader  reaches  the  ground,  the  main  or  return  stroke 
follows  the  ionized  path  from  the  ground  to  the  cloud.  It 
consists  of  a  20,000  ampere  current  pulse  and  lasts  about  100 
to  200  microseconds.  The  return  stroke  accounts  for  about  95 
percent  of  the  total  energy  radiated  by  the  lightning 
discharge.  Often  the  main  discharge  is  followed  by  one  or  more 
additional  strokes  which  follow  the  same  ionized  path.  The 
main  discharge  radiates  power  at  a  much  lower  frequency  than 
the  predischarge.  The  center  frequency  is  about  10  kHz  and  the 
upper  3  dB  point  is  approximately  15  kHz  [43.  Beyond  this,  the 
spectrum  decreases  as  l/f2. 

The  majority  of  the  power  in  VL F  (3-30  kHz)  and  LF  (30-300  kHz) 
atmospheric  noise  comes  from  the  powerful  return  strokes.  However, 
since  the  return  strokes'  power  spectra  decrease  as  l/f2, 
predischarges  contribute  the  majority  of  the  atmospheric  noise  at 


MF  (300-3000  kHz)  and  HF  (3-30  MHz).  For  still  higher 
frequencies,  atmospheric  noise  caused  by  lightning  can  usually  be 
ignored . 

Due  to  the  many  leader  strokes  of  the  predischarge  and  the 
occurrence  of  multiple  return  strokes,  the  received  noise  pulses 
may  not  be  considered  independent.  However,  neglecting  the 
continuously  changing  effects  of  weather  and  propagation  paths, 
these  clusters  of  noise  pulses  may  usually  be  considered 
independent  of  one  another.  In  temperate  climates  a  low 
frequency  receiver  will  detect  an  average  of  one  stroke  every 
second  [4].  In  the  tropics  the  average  rate  may  be  as  high  as 
100  per  second  [4]. 

The  sonar  channel  is  also  affected  by  naturally  occurring 
impulsive  noise.  The  major  contributor  in  shallow  water  is 
marine  life.  Many  species  can  produce  underwater  sounds  which 
can  mask  desired  signals  [5,6],  Among  the  more  important  noise 
makers  are  snapping  shrimp  and  croakers.  Snapping  shrimp, 
usually  found  in  shallow  tropical  or  subtropical  water,  make  a 
crackling  sound  by  repeatedly  snapping  closed  their  claws. 
Croakers,  found  predominantly  in  bays  on  the  east  coast  of  the 
United  States,  produce  repeated  bursts  of  tapping  sounds.  In 
areas  with  a  large  croaker  population,  the  sound  can  resemble  a 
continuous  roar.  Porpoise  barks  and  gobbles,  sea  robin  squawks, 
and  toadfish  "hoops"  may  be  among  other  additions  to  the  chorus. 
These  bursty  sources  of  short  duration  sounds  can  yield  a 
constantly  changing,  highly  impulsive  noise  environment. 

In  polar  regions,  ice  cracking  and  floe  movement  are  the 
major  contributors  to  acoustic  impulsive  noise.  Milne  and  Ganton 
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[7]  have  measured  acoustic  noise,  including  amplitude 
distributions,  under  arctic  sea  ice.  When  the  ice  pack  is 
solid,  thermal  stresses  cause  surface  cracks  which  can  be  heard 
by  a  hydrophone  below.  At  low  frequencies  (below  1  kHz)  the 
noise  is  found  to  be  impulsive.  For  frequencies  above  1  kHz  a 
more  Gaussian  noise  has  been  recorded.  They  hypothesize  that 
the  low  frequency  noise  is  caused  by  medium  to  large  thermal 
cracks  in  the  surface  ice.  Higher  frequency  noise  appears  to 
be  caused  by  numerous  smaller  thermal  cracks  located  near 
irregularities  on  the  ice  surface.  During  the  summer  months 
many  ice  packs  break  up.  Acoustic  noise  then  results  from  the 
motion  of  the  ice  and  collisions  between  ice  floes.  Under  all 
but  a  relatively  small  range  of  frequencies,  the  summer  ice 
noise  is  found  to  be  non-Gaussian. 

Man-made  Noise  Sources 


Middleton  [8]  divides  man-made  impulsive  noise  sources  on 
the  radio  channel  into  two  categories:  narrowband,  for  sources 
whose  noise  spectra  are  narrower  than  the  bandpass  of  most 
receivers,  and  broadband.  Automobile  ignition  noise,  generally 
considered  to  be  the  most  important  VHF/UHF  noise  source  in 
metropolitan  areas  [9],  can  be  classified  as  broadband.  It 
consists  of  random,  very  narrow  noise  pulses.  Neon  and 
fluorescent  lights  also  contribute  broadband  noise  [8,9].  The 
random  impulses  are  less  intense  than  those  from  automobile 
ignitions  and  can  usually  be  considered  a  local  disturbance. 
Other  broadband  impulsive  radio  noise  sources  are  arc  welders, 
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oil  burner  ignitions  and  other  ignition  noises.  Narrowband 
noise  includes  emissions  from  high  voltage  transmission  lines 
and  generating  stations,  electrical  motors,  and  various  other 
electrical  devices.  Interference  from  other  communications  and 
radar  systems  can  also  be  considered  as  narrowband  impulsive 
noise. 

The  sonar  channel  is  also  affected  by  man-made  noise 
[5,6].  Ships  contribute  appreciable  acoustical  noise  in 
harbors  and  shipping  lanes.  Also,  industrial  plants  along  the 
coastline  can  be  heard  for  a  considerable  distance  offshore. 

Telephone  circuits  must  contend  with  switching  noise.  It 
is  often  modeled  as  an  impulsive  process  [10,11].  The  noise  is 
broadband.  Since  the  impulses  usually  occur  in  bursts,  errors 
in  digital  transmissions  are  often  bunched  together. 


Section  2;  Impulsive  Noise  Models 

Impulsive  noise  models  are  often  grouped  into  two 

categories:  empirical  models  and  physical  models.  Empirical 

models  are  mathematical  constructs  which  attempt  to  fit 

measured  statistical  data.  Little  or  no  attempt  is  made  to 

relate  the  model  to  the  physical  mechanism  of  the  process. 

Physical  models,  on  the  other  hand,  attempt  to  describe  the 

entire  noise  process.  Most  consider  the  received  noise  as  a 

sum  of  filtered  impulses.  Due  to  the  mathematical  difficulty 

t  h 

.ivolved  with  studying  n  order  distributions,  most  research 
has  concentrated  on  first  order  characteristics.  The  main 
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emphasis  in  the  literature  is  on  the  modeling  of  the 
probability  distribution  of  the  narrowband  noise  envelope. 
Most  work  has  centered  on  atmospheric  noise. 


Empirical  Models 


In  most  situations  a  received  noise  process,  after  having 
passed  through  the  bandpass  filter  of  the  receiver,  can  be 
considered  as  a  narrowband  signal 


f(t)  *  x(t)cos[uit  +  <fc(t)] 


(2.1) 


When  f(t)  is  a  Gaussian  process,  samples  of  the  envelope  x(t) 
are  random  variables  having  a  Rayleigh  distribution.  The 
Rayleigh  distribution  is  given,  in  standard  form,  by 


g (x )  *  £-2  e  2o2 

0 

,  x>0 

(2.2) 

=  0 

,  X^O 

The  mean  is  0.5\/2tT  o.  The  variance  equals  (2-ir/2)o2. 

It  has  been  found  [12]  that  the  Rayleigh  distribution  is  a 
close  approximation  to  the  low-amplitude,  high-probability 
portion  of  the  atmospheric  noise  envelope  distribution.  This 
is  because  the  low-amplitude  portion  of  the  distribution  is 
caused  by  the  superposition  of  many  overlapping  small  pulses 
which,  according  to  the  Central  Limit  Theorem,  tend  to  behave 
like  a  Gaussian  process.  However,  atmospheric  noise,  and 


impulsive  noise  in  general,  has  far  more  large  amplitude  spikes 
than  predicted  by  the  Rayleigh  distribution.  For  this  reason. 


the  log-normal  distribution  was  proposed  [13].  Its  density  in 
standard  form  is 


p  ( x ) 


_i_  1  e 

■fl  TI  C  X 


=  o 


--Ly  {log  x  -  y)2 
20  , x  >0 


,x$0 


+  a2/2 


2y  +  o: 


The  mean  is  e  .  The  variance  is  given  by  e‘H  ’  u  (eu  - 

Figure  2.1  displays  both  the  Rayleigh  and  log-normal 
distributions.  The  parameters  are  chosen  such  that  both 
densities  have  the  same  mean  (0.5^37  ~  1.253)  and  variance 
( 2-n/2 ~  0. 429) .  The  much  heavier  tail  of  the  log-normal 
distribution  can  be  seen  in  Fig.  2.2,  which  is  an  enlargement 
of  the  tail  area  of  Fig.  2.1.  Since  tail  behavior  is  very 
important  in  impulsive  noise  modeling,  a  more  accurate  method 
of  display  is  required. 

Impulsive  noise  data  is  often  plotted  on  Rayleigh  graph 
paper.  Rayleigh  coordinates  are  chosen  such  that  the  Rayleigh 
distribution  plots  as  a  straight  line.  For  the  cummulative 
distribution  function  F(x)  *  Prob(x  <  t),  one  would  plot 
201og (x/xrms )  on  the  vertical  axis  and  -log(-ln[l  -  F(x)])  on 
the  horizontal  axis.  Fig.  2.3  displays  both  the  Rayleigh  and 
log-normal  distributions  using  Rayleigh  coordinates.  The 
heavier  tail  of  the  log-normal  is  now  more  readily  apparent 
(left  hand  side  of  the  graph). 
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Fig.  2.3  -  Rayleigh  and  Log-Normal  Distributions 

Rayleigh  Coordinates 
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Fig  2.4  shows  atmospheric  noise  data  measured  at  Boulder, 
Colorado  [12].  The  shape  of  the  curve  is  typical  for  broadband 
impulsive  noise.  Neither  the  Rayleigh  nor  the  log-normal 
distribution  provides  an  acceptable  fit  over  the  entire  range 
of  values. 

A  graphical  technique  to  derive  the  envelope  probability 
distribution  function  for  atmospheric  noise,  given  measurements 
of  three  moments,  was  developed  at  the  National  Bureau  of 
Standards  [12].  These  three  moments:  the  average  noise  power, 
the  average  envelope  voltage,  and  the  average  logarithm  of  the 
envelope  voltage,  were  continuously  recorded  during  the 
International  Geophysical  Year  <1958)  at  sixteen  stations 
throughout  the  world.  The  graphical  method  was  developed  to 
utilize  this  information  to  accurately  model  atmospheric  noise. 

Analyzing  the  data,  it  was  noticed  that  the  atmospheric 
noise  envelope  probability  distribution,  when  plotted  using 
Rayleigh  coordinates,  could  be  approximated  by  a  three  section 
curve.  The  low-amplitude,  high-probability  portion  corresponds 
to  the  Rayleigh  line.  The  high-amplitude,  low-probability 
portion  can  be  approximated  by  a  straight  line  of  somewhat 
steeper  slope.  It  can  be  shown  [14]  that  all  functions  of  the 
form  F(x)  *  1  -  exp(-kxm)  (and  only  these  functions)  will  map 
on  Rayleigh  coordinates  as  straight  lines.  This  family  of 
curves,  which  includes  the  Rayleigh  distribution,  is  often 
called  the  power-Rayleigh  distribution.  The  middle  amplitude 
portion  of  the  curve  was  then  approximated  by  the  arc  of  a 
circle  tangent  to  the  two  lines.  The  center  of  this  circle 
lies  on  the  bisector  of  the  angle  formed  by  the  Rayleigh  and 
power-Rayleigh  lines. 


Four  parameters  are  needed  to  specify  this  approximation 


to  the  true  distribution: 

(1)  a  point  on  the  Rayleigh  line 

(2)  a  point  on  the  power-Rayleigh  line 

(3)  the  slope  of  the  power-Rayleigh  line 
and  (4)  the  radius  of  the  circle 

Empirical  relationships  were  developed  [12]  to  obtain  these 
four  parameters  from  the  three  measured  moments.  Thus,  by 
measuring  the  three  moments,  a  function  can  be  obtained  which 
is  a  fairly  accurate  approximation  to  the  envelope  probability 
distribution  function  of  the  noise.  A  method  was  later 
developed  [15]  to  obtain  the  distribution  function  for  any 
bandwidth  by  transforming  the  distribution  function  obtained 
for  the  bandwidth  of  the  measurements.  These  graphical 
methods,  although  fairly  accurate,  were  not  given  any 
theoretical  justification.  Even  more  importantly,  the 
procedures  are  of  limited  use  in  the  analytical  treatment  of 
optimum  detectors  and  estimators. 

Many  other  distributions  have  been  suggested  to  model 
impulsive  noise  (see  [8]  for  a  concise  list).  Most  are  limited 
in  application  and  not  entirely  successful.  None  of  the 
empirical  models  give  a  true  picture  of  the  entire  interference 
process . 

Physical  Models 

Most  physical  models  attempt  to  describe  the  entire 
interference  process  by  modeling  the  received  noise  as  a  sum  of 
filtered  impulses: 


The  are  random  variables  which  describe  the  pulse 

amplitudes,  p ( • )  describes  the  pulse  shape,  and  the  random 
variables  {t^  are  the  occurrence  times.  Different 
assumptions  for  these  variables  can  lead  to  quite  different 
models . 

Furutsu  and  Ishida  [16]  were  interested  in  modeling  the  HF 
radio  noise  which  is  predominantly  caused  by  the  fine  structure 
of  the  lightning  discharge.  They  viewed  each  predischarge  as  a 
noise  packet.  They  assumed  Poisson  occurrence  times  for  these 
packets.  Within  each  packet  they  modeled  the  fine  structure  as 
another  independent  Poisson  process.  This  model,  consisting  of 
Poisson  noise  packets  occurring  in  a  Poisson  manner,  is  often 
called  Poisson-Poisson  noise. 

In  order  to  study  the  narrowband  noise  envelope,  Furutsu 
and  Ishida  assume  the  following  filtered  impulse  model: 

N  N' 

x(t)  *  l  l  r(t,  t^,  t'jj,  a  j  j  )cos[u>(t-t  ’  *  j  )-4>j  j] 
i=l  3=1 

where  t  =  the  time  of  observation 

t^  «  time  of  occurrence  of  i**1  pulse  packet 
t'jj  *  time  of  occurrence  of  the  j**1  pulse  within  the  ith 
packet 

w  *  receiver  center  frequency 

^  y.  x.  v 

.  *  phase  of  the  j  pulse  within  the  i  packet 


They  assume  the  phase  is  uniformly  distributed  over  (0,2n). 

The  packet  occurrence  times  are  assumed  to  be  uniformly 
distributed  over  the  interval  of  observation  (0,T)  and  the 
random  pulses  are  assumed  uniformly  distributed  over  the  length 
of  each  packet.  The  probability  that  N'  pulses  occur  in  time 
(0,t),  within  a  packet,  is  given  by 

N*  ~v*  t 

The  probability  that  N  packets  occur  in  (0,T)  is  given  by 

N  “^T 

PT(N)  -  (uT)  e 

For  clarity,  let  t  *  t  -  t^  *  the  length  of  time  from  the 
t  h 

start  of  the  i  packet  to  the  time  of  the  observation,  a  =  ai , 
and  t'  *  t'ij*  Now  assume 

-B(T-t’)  -of 

r(t,ti ,t'i j.ai j)  «  r(i,t',a)  *  ae  e 

Here  e~“  is  the  receiver  impulse  response  and  ae  p 
is  the  amplitude  of  the  impulse  within  the  packet.  Note  that, 
although  'a'  is  a  random  variable,  the  impulses  will  tend  to  be 
larger  at  the  start  of  a  packet  and  then  will  tend  to  decrease 
in  magnitude.  Also,  'a'  is  assumed  distributed  according  to 
w(a),  some  function  which  is  chosen  considering  the  spatial 
distribution  and  strength  of  the  impulse  sources. 
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Using  the  above  assumptions,  Furutsu  and  Ishida  calculated 
the  first  order  density  function  for  the  noise  envelope.  Their 
results  are 


f  (E)  *  EX  J0[£X]  i>{  X)  dX  (2.9) 

Here  J  [•]  is  the  zero  order  Bessel  function  (1st  kind) 
o 

and  i[)(X)  is  the  characteristic  function. 


i^(X)  =  exp  { 


U’  (X)  -  1]  dt  } 


(2.10) 


where. 


^  w(a)  ( JcC Xr ( t, t ' ,a) ]  -1)  da  dt’}  (2.11) 

Upon  substitution  of  r(-r,t’,a)  and  w(a)  the  integrals  become 
quite  intractable.  Simplifying  assumptions  lead  to  solutions 
for  special  cases.  These  solutions  have  shown  fairly  good 
agreement  with  measurements.  However,  since  the  density 
functions  cannot  be  put  in  closed  form  in  general,  this  model 
is  of  only  limited  use  in  the  analytical  treatment  of  detection 
and  estimation  problems. 

Beckmann  [14]  proposed  a  different  model  for  atmospheric 
noise.  He  assumed  individual  atmospherics  could  be  represented 
by  pulses,  whose  envelopes  could  be  written  as 


1  (  X )  =  exp  {  v'  | 


-(t-tk) /a 

uk  =  Ek  e  ,  t^tk  (2.12) 

(t-tk) /b 

■  Ek  e  '  t<fck 

The  rate  of  decay  for  the  spikes  was  assumed  to  be  much  slower 
than  the  rate  of  upsurge  (b<<a).  For  convenience,  define 
c  =  (a+b)/2,  the  average  time  constant.  Beckmann  chose  the 
peak  amplitude  E^  to  be  log-normally  distributed.  He 
justified  this  as  follows.  Divide  the  propagation  path  for  a 

A 

given  atmospheric  into  a  large  number  of  sections.  Assume  the 
attenuation  for  each  section  is  independent  of  the  rest  and 
that  no  one  section's  attenuation  will  dominate.  Then  the 


total  attenuation  (expressed  in  dB)  will  be  normally 
distributed  (mean  p,  variance  o2).  Since  the  log  of  the 
peak  amplitude  ( E^ )  is  proportional  to  the  total  attenuation, 
the  peak  amplitude  is  log-normally  distributed. 

Using  Poisson  occurance  times  (with  v  *  the  mean  number 
of  pulses  per  unit  time)  and  considering  the  phase  to  be 
uniformly  distributed,  the  values  for  all  the  pulses,  both 
rising  and  decaying,  are  summed.  In  the  low  density  case 
(vc<<1),  the  nearest  pulse  will  dominate  the  others.  The 
rest  can  be  approximated  by  a  single  pulse  with  a  Rayleigh 
envelope.  Thus  the  received  noise  can  be  modeled  as  the  sum  of 
a  pulse  with  a  log-normally  distributed  envelope  and  a  pulse 
with  a  Rayleigh  envelope.  Beckmann  calculates  the  distribution 
function  of  the  total  noise  envelope  as 
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I  exp[-4.n  y  -  y)  2  _  _E 
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(2.13) 


2  2  2  (  2^  \ 

where  M  =  v  c  e  0  ^'ln(l/vc)  is  approximately  equal 
to  the  noise  power  and  IQ(  •)  is  the  0t^1  order  modified  Bessel 
function.  This  distribution  is  nearly  Rayleigh  for  small  E  ana 
nearly  log-normal  for  large  values  of  E.  The  four  parameters  of 
the  density  v,  c,  a  and  y  can  be  directly  related  to  the 
properties  of  the  noise.  Both  v,  the  mean  number  of  discharges 
per  unit  time,  and  c,  the  average  time  constant,  depend  only  on 
the  properties  of  the  discharges.  The  parameter  o,  the 
standard  deviation  of  the  total  attenuation  of  the  pulses,  and 
y»  the  mean  value  of  the  total  attenuation,  are  determined 
almost  entirely  by  the  properties  of  the  propagation  path.  Also, 
when  plotted  on  Rayleigh  coordinates,  vc  determines  the 
position  of  the  Rayleigh  line  (low-amplitude,  high-probability 
line),  o  determines  the  log-normal  curve  (high-amplitude, 
low-probability),  and  v,  c,  o  and  y  together  determine  the 
rms  value.  Thus,  knowing  these  four  parameters,  a  density  can  be 
sketched  by  connecting  these  two  functions  with  a  circular  arc  as 
was  done  by  Crichlow,  et  al.  [12]. 

Although  Beckmann's  model  is  closely  related  to  the 
statistical  parameters  of  the  noise  process,  the  integral  of 
Eq.  (2.13)  is  rather  cumbersome.  A  general  solution  can  only 
be  found  for  very  small  or  very  large  envelope  levels.  Hence, 
Beckmann  provides  a  theoretical  justification  for  the  use  of 
the  log-normal  distribution  for  strong  atmospherics,  but  he 
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doesn't  provide  a  density  function  which  can  represent  the 
noise  over  the  entire  range  of  interest.  Giordano  [17]  ?*nd 
Middleton  [8,18,19,20]  have  both  developed  filtered  impulse 
models  which  are  much  more  widely  applicable.  Giordano's  model 
is  similar  to  those  previously  described  and  will  not  be 
treated  further.  Middleton's  model  is  a  more  complete  and  a 
more  general  model  than  those  presented  above. 

Middleton  represents  the  received  noise  process  as 

y (t)  *  x(t)  +  n(t) 

where  x(t)  is  the  impulsive  component  and  n(t)  is  a  Gaussian 
background  (variance  oG2).  He  assumes 

x ( t )  =  [  ( t, e) 

j  J 

where  Uj(t,0)  is  the  waveform  emitted  by  the  j**1  source  after 
having  passed  though  the  receiver.  All  received  noise  pulses  are 
assumed  to  have  the  same  basic  waveform  with  random  variations 
in  scale  and  structure  represented  by  the  random  parameter  j$. 
Middleton  assumes  the  sources  emit  independently  in  time 
according  to  a  Poisson  distribution  with  mean  rate  v.  He 
denotes  the  mean  duration  of  a  pulse  by  T.  The  source  locations 
are  assumed  Poisson  distributed  in  a  space  A  with  coordinates 
Middleton  uses  characteristic  functions  to  derive  the 
density  function  for  the  noise.  He  considers  two  cases: 
narrowband  (noise  whose  spectra  are  narrower  than  the  passband 
of  the  receiver)  and  broadband  (noise  with  spectra  comparable  to 


or  wider  than  the  passband  of  the  receiver).  Note,  however, 
that  both  cases  produce  narrowband  noise  in  the  receiver.  That 
is,  in  both  cases  the  noise  can  be  modeled  by  its  envelope  and 
phase.  For  both  the  narrowband  and  broadband  cases,  the 
characteristic  function  for  x(t)  can  be  written. 


i}jx(u)  =  exp[E  {A  Jq(Bu)  -  A}]  (2.16) 

where  JQ(«)  is  the  0*^  order  Bessel  function  (1st  kind), 

B  =  B(t,X.,£)  is  the  envelope  of  U(t,0)  (which  can  be 
directly  related  to  the  physics  of  the  noise  generation  and 
propagation),  and  A  *  vT  is  the-  first  basic  parameter  of  the 
model.  The  parameter  "A"  is  termed  the  "Impulsive  Index."  It 
measures  the  amount  that  the  noise  pulses  overlap  in  time.  For 
high  levels  of  overlap  (large  A),  the  density  will  approach  the 
Gaussian  distribution.  When  the  amount  of  overlap  is  small 
(small  A),  the  resulting  noise  will  be  highly  impulsive. 

For  narrowband  noise,  several  approximations  can  be  made  in 
Eq. (2.16).  The  resulting  characteristic  function  for  x(t)  is 

®  m  -5!  E  (B  2  }u  2 

*x(u)  e~A  l  £_  e4  (2.17) 

m  =  0 

The  characteristic  function  for  y(t)  is  then 
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(2.18) 


This  function  can  now  be  inverted.  Letting  z  be  the  normalized 
noise  (z  *  v/Vr-ms^  '  Middleton  finds  for  the  probabi  lity 
density  function  of  the  instantaneous  amplitude  in  the 
narrowband  case : 


f(z)se-A  l  e  2°m2  (2.19) 

m=Q  m  •  /z  Trom 

where 

om2  =  5LZAJL_P  (2.20) 

m  1  +  P 

and  "P"  is  the  ratio  of  the  power  in  the  Gaussian  background  to 
that  in  the  impulsive  component 
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Og 
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1/20  e{b2}3 


(2.21) 


It  is  the  second  basic  parameter  for  the  narrowband  model. 
Note  that  Eq.  (2.19)  is  just  a  weighted  sum  of  Gaussian 
distributions  with  increasing  variance  o*.  For  the 
envelope  of  the  received  noise,  the  model  yields 


Prob 


[E>E0]  e~A 
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I  — 

m=0  *• 


(2.22) 


which  is  just  a  weighted  sum  of  Rayleigh  distributions. 
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In  the  broadband  case  the  simplifying  assumptions  for 

Eq.  (2.16)  can  no  longer  be  used.  The  procedure  becomes  more 

difficult  but  is  still  tractable.  One  complication  is  that  the 

resulting  density  has  infinite  variance  for  certain  ranges  of 

the  parameters.  Thus,  to  normalize  the  noise,  the  power  in  the 

Gaussian  background  alone  is  used:  z  -  y/n  .  The  density 

rms 

can  be  written 


-Aa  oo  mm 

f(z)~  le  [  i Z±1  Aa  r(SL°±l)  1F1  ('mot-  1,  z2)  (2.23) 

7T  m=0  m‘  2  2  2 

where  ^F^(  •,«,•)  is  the  confluent  hypergeometric  function  [21], 

r(  •)  is  the  gamma  function  and  a,  A  are  the  two  basic 

a 

parameters  for  the  model.  They  are  both  related  to  the  source 
distributions  and  propagation  laws  for  the  noise.  The  envelope 
in  the  broadband  case  is 

-E0  2  ®  m  m 

Prob[E>E0]  ®  e  [1  -  EQ  2  £  Aa  T(1  +  ®SL)  •  (2.24) 

m=l  »•'  2 

•iFld  -  ft.  2.  E02)J 

Middleton  has  presented  evidence  that  Eqs.  (2.22)  and  (2.24) 
are  accurate  representations  for  the  envelope  of  impulsive 
noise.  Figs.  2.5,  2.6,  2.7,  and  2.8  are  taken  from  [8]. 

Fig.  2.5  shows  a  comparison  between  Middleton's  narrowband 
model  ( A=0. 35 ,  P=  0.5x10  ^)  and  actual  narrowband  impulsive 

_4 

noise  data.  Fig.  2.6  is  a  similar  display  (A=10  ,  P=50)  for 


narrowband  electromagnetic  noise  from  ore  crushing  machinery. 
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Fig.  2.5  -  Comparison  of  a  Measured  Envelope  Distribution 
with  the  Middleton  Model  -  Narrowband  Case 
(from  Spaulding  and  Middleton  [8]) 
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Fig.  2.6  -  Comparison  of  a  Measured  Envelope  Distribution 
with  the  Middleton  Model  -  Narrowband  Case 
(from  Spaulding  and  Middleton  [8]) 
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Comparison  of  a  Measured  Envelope  Distribution  with 
the  Middleton  Model  -  Broadband  Noise  (Man-Made  Noise) 
(from  Spaulding  and  Middleton  [8]) 
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Fig.  2.8  -  Comparison  of  a  Measured  Envelope  Distribution  with 

the  Middleton  Model  -  Broadband  Case  (Atmospheric  Noise) 
(from  Spaulding  and  Middleton  [8]) 


In  both  cases  agreement  between  the  model  and  data  is  quite 
good.  Figs.  2.7  and  2.8  compare  Middleton's  broadband  model 
with  measured  impulsive  noise  data.  Fig.  2.7  shows  broadband 
man-made  noise  (primarily  automobile  ignition  noise)  and 
Fig.  2.8  displays  atmospheric  noise.  Again  close  agreement 
between  the  model  and  measurements  are  found. 

The  most  important  aspect  of  the  Middleton  model  which 

sets  it  apart  from  the  rest  is  that  the  parameters  for  this 

model  (A  and  P  or  a  and  A  )  can  be  determined  explicitly 
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from  physical  considerations.  The  model  can  be  tested  by  the 
data  rather  than  just  fit  to  the  data.  The  chief  problem  with 
this  model  is  the  assumption  of  independent  noise  pulses.  This 
often  is  not.  valid.  Unfortunately,  as  with  the  other  models, 
the  equations  are  intractable  without  this  assumption. 

Another  physical  model  was  proposed  by  Hall  [4].  Although 
not  as  comprehensive  as  the  Middleton  model,  it  deserves 
attention  because  it  is  not  of  the  filtered  impulse  type.  Hall 
was  interested  in  modeling  VLF  atmospheric  noise.  In  order  to 
achieve  a  large  dynamic  range,  he  proposed  a  model  which 
considered  the  received  noise  process  to  be  of  the  form 

y(t)  *  a(t)n(t) 

where  a(t)  is  a  slowly  varying  stationary  random  process  and 
n(t)  is  an  independent,  narrowband  Gaussian  process.  Hall 
selected  a  first  order  distribution  for  a(t)  which  was 
analytically  tractable  and  which  yielded  good  agreement  with 
measured  atmospheric  noise  data.  His  distribution  for  a(t)  is: 


fa(a)  =  — —  - i—  e  0 

o®r(m/2)  la  im+1 

where  m  and  a  are  two  parameters  to  be  chosen.  Using  the 
first  order  distribution  for  the  Gaussian  process, 


fn(n)  *  - i -  e  2oi 

/Tv  a± 

he  calculated  the  first  order  probability  density  function  for 
y(t)  as 


n|) 

Vyl  *  7^17 
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JT  (y2+v2) 
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where 


6  «  !ti  +  1 


For  the  special  case  =  o.  Eq.  (2.28)  reduces  to  the 
density  for  Student's  "t".  Hence,  Hall  named  Eq.  (2.28)  the 
generalized  "t"  distribution. 

As  a  check  on  his  model.  Hall  calculated  the  first  order 
distribution  function  for  the  envelope  and  phase  of  y(t).  He 
obtained 


fE(£) 


(6-1) 


Y6-1 
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_T 

(E2  +  y2)T” 


(2.26) 


(2.27) 


(2.28) 


(2.29) 


and  a  uniform  phase.  For  6  in  the  range  2<e$4,  Eq.  (2.29) 
agrees  well  with  VLF  atmospheric  noise  data.  The  value  0=3 
is  best;  however,  the  density  function  [Eq.  (2.28)]  has 
infinite  variance  for  6=3  and  thus  can  not  represent  a  real 
process. 

Although  Hall's  model  is  simple  enough  to  use  in  detection 
problems  and  is  fairly  accurate,  it  does  not  have  a  strong 
physical  basis.  The  parameters  6  and  y  bave  no  real  physical 
meaning  and  must  be  selected  to  fit  statistical  data  rather 
than  physical  conditions. 


Section  3;  Tractable  First  Order  Non-Gaussian  Noise  Models 


Although  the  models  described  in  Section  2  are  very 
detailed  and  many  are  quite  accurate,  they  are  all  quite 
complicated.  Most  cannot  be  used  in  analytical  treatments  of 
detection  problems.  Those  which  can  are  cumbersome.  There  is 
a  need  for  simple,  first  order  families  of  densities  which  can 
be  used  to  describe  the  first  order  statistics  of  non-Gaussian 
noise.  Three  possibilities  are;  a  generalized  Gaussian  noise 
the  Johnson  Su  System,  and  mixture  models.  These  three 
systems  are  used  throughout  this  dissertation  to  provide 


examples. 
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A  Generalized  Gaussian  Noise 


A  generalized  Gaussian  noise  [22],  which  provides  a  system 
of  density  functions  whose  tails  can  be  made  heavier  or  lighter 
than  that  of  the  Gaussian  distribution,  is  given  by 


cn(  a,  c)  e 
2r(l/c) 


-Cn(  a,  c)  ix  i  ] 


,  c>0 


(2.30) 


where 


ri(o,c)  = 


=  i  [r  (3/c)' 

°  [ra/c)_ 


The  case  c=2  is  the  familiar  Gaussian  distribution  and  cel  yields 
the  Laplace  or  double  exponential  distribution.  The  density 
functions  are  plotted  in  Fig.  2.9  for  values  of  c  in  the  region 
of  most  interest,  0=0.5  to  c=3. 

I 

The  locally  optimum  nonlinearities  for  detecting  a  constant 
signal  in  this  noise  are  given  by 


gloc(x)  *  c[n(o,c)]c  ixic_1  sgn(x)  ,c>0 


(2.31) 


They  smoothly  cover  the  transitions  from  a  blanker,  whose  tail 
region  decreases  as  1/x,  through  the  sign  and  linear  detectors, 
to  an  expander.  The  locally  optimum  detectors  corresponding  to 
the  densities  of  Fig.  2.9  are  given  in  Fig.  2.10.  The  c=2  case 
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is  of  course  the  linear  detector  and  c  =  l  gives  the  hard  limiter 
or  sign  detector.  The  Neyman-Pearson  optimum  nonlinearities 
for  the  constant  signal  case  are 

gopt(x)  *  [r)(o#c)3c  (  I x  I °  -  I x  —  ©  I c )  ,c>0  (2.32) 

These  are  plotted  in  Fig.  2.11  for  the  signal  strength  0=o/2. 

Fisher's  Information  can  be  calculated  by  substituting 
Eq.  (2.30)  into  Eq.  (1-17).  After  integrating  [see  Appendix 
2.1]  one  obtains, 

c2  r(|} 

1(f)  =  -  ,c>l/2  (2.33) 

o2  [r(i)3  2 

c 

=  ®  ,  0<c^l/2 

This  function  is  plotted  in  Fig  2.12.  Since  the  Gaussian 
distribution  minimizes  Fisher's  Information  over  all  fixed 
variance  distributions,  the  curve  has  a  minimum  at  c*2.  The 
peaked  origin  of  the  density  function  accounts  for  the  greater 
information  for  location  for  c<2,  and  the  greatly  diminished 
tails  account  for  the  case  c>2. 

The  Johnson  Su  System 

The  Johnson  Su  System  [23]  can  be  used  to  provide  a 
system  of  heavy-tailed  symmetric  distributions.  It  is  formed 
by  a  memoryless  transformation  of  the  Gaussian  distribution, 


x  =  a 


+  X  sinh1 5 — ' 


(2.34) 


The  resulting  density  functions  can  be  written 


f(x)  = 


-  rk  i  [l  *  C=i>'  1 
*2r  X  L  X 


..  2  ,-1/2  -l/2{  y  +  6sinh'1(^)} 


(2.35) 


The  parameters  a  and  y,  which  affect  location  and  symmetry 
are  set  equal  to  zero.  The  parameters  6  and  X  affect  the 
shape  and  scale  of  the  densities.  The  choice  of 


(2.36) 


yields  densities  of  common  variance  a2.  Thus  the  densities 
can  be  specified  by  the  two  parameters  o2  and  6.  A  few 
densities  from  the  Johnson  System  are  plotted  in  Fig.  2.13. 
Small  values  of  6  correspond  to  heavy-tailed  distributions. 
In  the  limit  as  6  ■*  the  densities  approach  the  Gaussian 
distribution. 

The  locally  optimum  nonlinearities  for  the  detection  of 
constant  signals  are  given  by 
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Fig.  2.15  -  Johnson's  S^  System:  Optimum  Nonlinearities 

(signal  strength  =  -=•  ) 
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The  nonlinearities  corresponding  to  the  densities  of  Fig.  2.13 
are  given  in  Fig.  2.14.  All  locally  optimum  nonlinearities  in 
this  system  are  noise  blankers  since  lim  g.  (x)  *  0  for  all 

XOC 

6  [see  Appendix  2.2].  The  Johnson  System  is  an 
especially  useful  noise  model  since  its  distributions  appear 
Gaussian  in  the  middle  but  have  heavier  tails.  That  is,  the 
locally  optimum  nonlinearities  are  nearly  linear  about  the 
origin,  but  all  become  noise  blankers  eventually.  The 
Neyman-Pearson  optimum  nonlinearities  for  0*c/2  are  plotted 
in  Fig.  2.15. 

A  graph  of  Fisher's  Information  versus  th*»  parameter  6 
is  given  in  Fig.  2.16.  Small  values  of  6,  corresponding  to 
heavy-tailed  distributions  with  large  peaks  at  the  origin,  have 
relatively  high  values  for  Fisher's  Information.  As  6 
increases  o2I(f)  rapidly  approaches  one,  the  value  for  the 
Gaussian  distribution. 

Mixture  Models 

Mixtures  have  been  frequently  used  in  data  analysis  to 
either  add  uncertainty  to  statistical  assumptions  or  to  account 
for  gross  errors  [24,25,26,27].  They  have  also  been  used  in 
detection  problems  to  test  the  robustness  of  standard  detectors 
and  to  aid  in  the  design  of  more  robust  detectors  [28,29,30]. 
The  basic  mixture  density  is 


f(x)  *  (1  -  cJf^Cx)  +  cf2<x) 


0  <  e  <  1 


i 


(2.38) 


The  density  f^(x)  is  the  nominal  density  and  is  very  often 
chosen  to  be  the  Gaussian  density.  The  density  f 2 ( x )  is  the 
contaminant.  It  is  selected  to  fit  the  particular  application. 
Huber  [26]  allows  f2(x)  to  be  a  member  of  the  class  of  all 
distributions  in  his  work  on  robust  estimation.  Others  [24,25] 
have  selected  f2(x)  to  be  Gaussian  with  a  variance  much 
larger  (variance  ratios  between  2  and  100  have  been  used)  than 
that  of  f^(x).  The  percent  of  contamination  c  is  usually 
chosen  to  be  small  (often  less  than  0.1). 

Figs.  2.17-2.24  display  the  densities,  detector 
nonlinearities,  and  Fisher's  Information  for  a  mixture  of  two 
Gaussian  distributions.  For  f^x),  the  Gaussian  density  with 
variance  c^ 2  *  l/(l-e+ey2)  is  used.  The  Gaussian  density  with 
variance  c^^y2^2  is  used  for  f2(x).  Thus,  the  ratio  is 
c2Vci2  =  y2  and  the  overall  variance  is  unity  since 
a2~ (1-e) o^2+eo22=1*  Figs.  2.17,  2.19,  and  2.21  display  the 
densities,  the  locally  optimum  nonlinearities  and  the  optimum 
nonlinearities  for  the  Gaussian-Gaussian  mixture  with  y2*100 
and  various  values  for  c.  Figs.  2.18,  2.20,  and  2.22  are 
similar  graphs  for  e*0.1  and  various  y2  values.  Note  that 
the  high-probability,  low-amplitude  portion  of  the 
nonlinearities  is  linear.  This  region  is  primarily  determined 
by  the  nominal  density,  which  is  Gaussian.  The  low- 
probability,  high-amplitude  portion  is  also  linear,  but  with  a 
different  slope.  This  region  is  primarily  determined  by  the 
contaminating  density.  There  is  also  a  nonlinear  transition 
region  which  connects  these  lines.  Figs.  2.23  and  2.24  display 
Fisher's  Information  versus  e  for  y2  *  100  and  Fisher's 


I  (f) 


Information  versus  y*  for  e  =  0.1.  These  are  not  straight 
lines.  However,  over  the  range  of  most  interest,  the 
relationships  are  nearly  linear. 


Section  4:  The  Detection  Problem  in  an  Impulsive  Environment 


The  detection  problem  introduced  in  Chapter  I  [Eq.  (1.1)] 
can  be  solved  if  the  statistical  characteristics  of  the  noise 
are  known.  The  optimum  detector  is  given  by  Eq.  (1.5)  and  the 
locally  optimum  detector  is  given  by  Eq.  (1.9).  Both  of  these 
detectors  consist  of  a  nonlinearity  followed  by  a  summer  and  a 
threshold  comparator,  as  shown  in  Fig.  1.1.  The  difficulty  of 
course  is  that  the  density  function  for  the  noise  is  not 
perfectly  known.  Equations  have  been  formulated  (see  Chapter 
II  -  Sections  2  and  3)  to  model  noise;  however,  it  must  be 
remembered  that  these  are  only  approximations  to  the  true 
densities.  The  optimum  detectors  for  these  models  will  not 
necessarily  be  optimum  for  the  actual  noise.  Worse  yet,  the 
noise  is  often  non-stationary .  The  optimum  detector  would  then 
have  to  change  to  reflect  the  changes  in  the  noise  statistics. 

Since  optimum  nonlinearities  are  difficult,  if  not 
impossible,  to  design  for  actual  noise,  several  authors 
[29,31,32,33]  have  studied  the  performance  of  different 
suboptimum  nonlinearities.  Bernstein  [31]  calculated  the 
locally  optimum  nonlinearity  for  a  set  of  VLF  atmospheric  noise 
data.  He  reported  that  the  nonlinearity  was  roughly  linear 
near  the  origin  and  that  it  suppressed  high  amplitude 


observations.  Both  limiters  and  blankers  can  be  used  as 
approximations  to  this  type  of  nonlinearity.  Fig.  2.25 
displays  simple  limiter  and  blanker  nonlinearities.  The 
performance  of  limiters  and  blankers  for  various  noise  models 
has  been  studied  [29,31,32,33].  The  noise  blanker  (or  hole 
puncher)  performs  well  for  a  wide  variety  of  densities; 
however,  its  performance  is  very  sensitive  to  the  selection  of 
the  noise  blanking  level.  The  limiter  (or  clipper),  although 
not  quite  as  effective  as  the  blanker,  is  less  sensitive  to  the 
selection  of  its  limiting  level.  Other  more  complex 
approximations  to  the  optimum  nonlinearities  may  be  used  [29]. 
The  better  the  approximation,  the  better  will  be  the 
performance.  However,  the  disadvantages  of  using  a  complex 
nonlinearity  in  the  receiver  may  outweigh  the  increased 
performance. 

Another  approach  to  the  problem  of  detecting  signals  when 
the  statistics  of  the  noise  are  not  well  known  is  through  the 
techniques  of  robust  detection.  Often  the  noise  can  be 
considered  to  belong  to  a  well  defined  class  of  noises.  One 
then  must  look  for  a  detector  which  has  reasonably  good 
performance  over  the  entire  class.  Several  authors 
[30,34,35,36]  have  considered  different  classes  of  mixture 
models.  In  many  cases  mini-max  detectors  have  been  established 
for  the  class.  Some  results  which  facilitate  the  design  of 
mini-max  detectors  are  presented  in  Chapter  III  of  this 
dissertation. 

If  very  little  information  is  available  on  the  statistical 
characteristics  of  the  noise,  it  is  often  wise  to  use 


nonparametric  detection  techniques.  There  are  many  good 
references  on  nonparametric  detection  [37,38,39,40].  These 
techniques  will  not  be  treated  further  in  this  dissertation. 

For  applications  where  the  noise  statistics  are  unknown  or 
time-varying  and  near  optimum  detector  performance  is 
essential,  an  adaptive  detector  may  be  necessary.  Although 
more  complex  than  those  detectors  mentioned  above,  when 
properly  designed,  it  can  be  nearly  optimum  for  a  wide  variety 
of  noise  environments  [41,42,43].  Chapter  IV  presents  an 
adaptive  detection  scheme  which  performs  well  for  several 
different  noise  densities. 


Appendix  2 . 1 


Fisher's  Information  for  the  Generalized  Gaussian 
Distribution  of  Chapter  II  -  Section  3 


Starting  with  Eg.  (1.17)and  substituting  Egs.  (2.30) and 
.2. 31) ,  we  have 
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for  0  <  c  $  1/2.  This  establishes  Eq.(2.33). 


by  L'Hopital's  rule. 
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In  many  detection  problems  the  statistical  characteriza¬ 
tion  of  the  noise  is  not  complete.  For  example,  it  may  only  be 
possible  to  define  a  class  of  probability  densities  which 
contains  the  actual  noise  density.  In  such  cases  minimax 
detectors  may  be  used  to  guarantee  a  lower  bound  on  detector 
performance  for  the  entire  class. 

Once  a  class  of  densities  has  been  specified,  the  worst 
case  density  within  the  class  must  be  identified.  The  best 
detector  for  this  density  is  the  minimax  detector  for  the 
class.  Sections  1,  2,  and  3  of  this  chapter  establish  the  pair 
(*0*Do)  as  the  saddlepoint  of  this  detection  problem.  The 
density  fQ  is  the  member  of  the  class  with  minimum  Fisher's 
Information.  The  detector  is  the  locally  optimum  detector 
for  density  fQ.  In  Section  4  a  few  classes  of  densities  are 
considered  and  relationships  are  established  to  determine  the 
minimax  detector. 
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Section  1:  The  Relationship  Between  Fisher's  Information  and 


the  Asymptotic  Performance  of  Optimum  Detectors 


Fisher's  Information  has  been  associated  with  the 
asymptotic  performance  of  estimators  and  detectors  before. 

Capon  [1]  has  related  it  to  the  efficacy  of  the  locally  optimum 
detector  (see  Appendix  3.1).  Ingram  [2]  related  Fisher's 
Information  to  the  improvement  (measured  as  an  increase  in  the 
signal-to-noi se  ratio)  obtained  with  using  the  locally  optimum 
detector  rather  than  the  linear  detector.  Huber  [3]  related 
Fisher’s  Information  to  the  asymptotic  variance  of  estimators 
in  his  work  on  robust  estimation.  He  concluded  that  the  most 
robust  M-estimate  over  a  given  class  of  densities  is  the 
maximum  likelihood  estimate  for  the  density  which  minimizes 
Fisher's  Information.  El-Sawy  and  VandeLinde  [4]  applied 
Huber's  results  to  detection  problems.  Their  M-detectors 
utilize  Huber's  M-estimates  to  form  the  test  statistic. 

The  results  presented  here  are  different.  Fisher's 
Information  is  related  directly  to  the  asymptotic  probability 
of  detection  for  both  optimum  and  locally  optimum  detectors. 

It  forms  part  of  a  generalized  signal-to-noise  power  ratio  and 
can  be  thought  of  as  a  measure  of  the  difficulty  of  detecting  a 
signal  in  a  given  noise. 

Consider  the  problem  which  was  first  presented  in  Eq.  (1.1), 


H:  x^  =  n^ 


K:  Xi  =  ni  +  8si 


6  >  0,  i  =1 . N 


(3.1) 
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As  before,  the  known  signals  {8s^}^  obey  0  <  <  |s^|  <  <  » . 

N 

The  noise  samples  {n^}^  are  realizations  of  the  independent  random 
N 

variables  {N^}^  with  densites  f^  e  F,  a  class  of  symmetric 
densites  which  will  be  defined  below.  The  Neyman-Pear son 
optimum  detector  is  given  by  Eq.  (1.5)  and  the  locally  optimum 
detector  is  given  by  Eq.  (1.9). 

The  performance  of  these  detectors  is  usually  measured  by 
the  power  function  g(  6)  =  ProbK(decide  K),  and  by  the  level 


a  =  Prob  (decide  K).  For  a  fixed  level  a,  the  Neyman-Pearson 

n 


optimum  detector  maximizes  the  power  function  $(8),  while  the 


locally  optimum  detector  maximizes  the  slope  of  the  power  function 

d£ 


.  The  level  a  is  fixed  by  the 


near  the  origin  (  ^ 
appropriate  choice  of  the  threshold  T.  In  order  to  calculate  the 
power  function,  the  probability  distribution  function  of  the  test 
statistic  [T  fc(x)  of  Eq-  (1*5)  or  Tloc(x.)  of  Eq.  (1.9)]  is 


required  under  both  H  and  K.  This  is  difficult  to  obtain  in 

iN 


general  since  an  N-fold  convolution  of  the  densities  { f ^  ^ 
is  required.  However,  a  few  special  cases  have  been  solved.  The 
performance  of  the  linear  detector  in  Gaussian  noise  is  well  known 
and  the  amplifier-limiter  operating  in  Laplace  noise  has  also  been 
studied  recently  [5]. 

In  detecting  a  small  signal  using  a  large  number  of 
observations,  it  is  usually  possible  to  use  the  Central  Limit 
Theorem  to  show  that  the  test  statistic  is  asympotically  normally 
distributed.  One  simple  form  of  the  Central  Limit  Theorem  [6] 


st  a 


Let  be  a  series  of  independent  random 


variables  having  arbitrary  distributions, 
y1#  •  •  •  ,  mn  and  variances  o2^,  •  .  •  ,  o2^ 
respectively.  Let 
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means 


,  n  =  1 


>  N 


If  o 2 ^  <  ®  for  all  i  and  if  there  exists  positive  constants 

m  and  M  such  that  o2^  >  m  and  E{  lt^-y^  I 3)  <  M  for 

all  i,  then  the  distribution  of  yn  approaches  in  the  limit  the 

unit  normal  distribution.  (In  the  special  case  where  the 

{t^}^  are  identically  distributed,  the  condition  o2^  <  ® 

for  all  i  is  sufficient.) 

Using  this  theorem,  it  is  straightforward  to  show  that 


and 


(3.2) 


(3.3) 


provided  that  the  conditions  on  y^  and  o2^  are  met  under 
both  H  and  K.  These  three  conditions  are  used  to  define  the 
class  of  densities  F  for  which  the  results  of  this  section 
hold.  The  symbol  ■*  signifies  equality  in  the  limit  as  N 
approaches  infinity  and  6  approaches  zero.  The  symbol  $(•) 
represents  the  unit  normal  cumulative  distribution  function. 


Also,  =  EK^i  'xj)-l  and  0  iK  *  Similarly, 

p^pj  and  c2^  are  the  corresponding  means  and  variances 

under  H.  For  the  Neyman-Pearson  optimum  detector  g^(x^)  =  90pt  ^xi 

and  for  the  locally  optimum  detector  g^(x^)  =  9loc  (x^)  [as  defined 

by  Eqs.  (1.6)  and  (1.10)]. 

First  consider  the  Neyman-Pearson  optimum  detector 


T  ( 
opt' 


N  f. (x.-Bs.)  (  > 

x)  =  I  log  ^ - - —  < 

i=l  f<(x<)  (< 


T  decide  K 
T  decide  H 


Assume  that  the  f^  have  infinite  support  with  derivatives  of  all 
order  on  (-*,»>)  and  that  f^(«)  =  f^(-<*>)  =  f^’(oo)  =  £  •(-.*)  =  0 
for  all  i.  In  order  to  calculate  the  power  function  using  the 
Gaussian  approximation  provided  by  the  Central  Limit  Theorem, 
the  mean  and  variance  of  the  single  sample  test  statistic  must 
be  calculated. 

r  l  T  fi(xi"0si) 

yiH  =  EH  L  gopt .  (xi}  J  *  EH  1o9  - ;  7-7" 

1  fi<xi) 

Expand-' ng  in  a  Taylor  series  about  9=0  yields 
piH  =  '  7  e2si2I(fi)  +  04K1(9,si,fi) 


where 


(x)  dx 


is  Fisher's  measure  of  information  for  the  location  parameter  in 
the  density  f^(»),  and  K^(6,s^,f^)  is  a  power  series 
containing  the  remaining  terms  of  the  expansion.  Similarly, 


°1H  =  EH  1o9 


fi(xi) 


wiH 


=  e2si2i(fi)  +  e4K2(e,si,fi) 

where  K2(e,s^,f^)  contains  the  remaining  terms  of  the 
series.  Since  f^(»)  is  assumed  symmetric,  it  can  be  shown 
[7]  that  yiK  =  and  o2iK  =  c2iH-  Using  the 

Gaussian  approximation,  the  threshold  is  chosen  to  fix  a: 


a  =  Prob, 


[Topt<*>  »*]-!-  #  77^ 


T  ’  ,1  "iH 


I  a. 

i-l  lH 


-1  /  N  2  N1/2  N 

T  =  *  1(!-a)  I  ciH  ♦  iyiH 


The  symbol  denotes  approximate  equality.  The  power  function 
is  then  calculated: 


3(6)  =  ProbK[Topt(x)  >  t]=  1  - 


N 

T  -  l  V. 


N  2\l/2 

ill'*)  / 


I  2 ViH  \ 

s  1  *  f  I  $’1(l-a)  +  — - : -  1 


(3.8) 


(3.9) 


(3.10) 


(3.11) 
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2  2  w  2 

In  the  limit  as  ©-►O  and  N-*-®  with  k  «0  Z  s.  I(f.)  *  a  nonzero 

i-1  1  1 


constant,  the  power  function  becomes 


0(6)  =  1  -  «( 4>'  (1-a)  -  k) 


provided  and  cr  are  bounded. 

The  performance  for  the  locally  optimum  detector  is 
calculated  similarly. 


(3.13) 


N  ~f i 1 ( x i  ) 
‘loc(x)  55  I  - 

i=l  fi(Xi) 


>  T  decide  K 
<  T  decide  H 


(3.14) 


The  mean  and  variance  of  the  single  sample  test  statistic  under 
H  are 


PiH  *  £H 


°2iH  *  eH 


'fi' (xi> 

fi(Xi) 


~fi*(Xi)  „  V  =  ..2 


fi(Xi) 


Si2  I ( f i ) 


(3.15) 


(3.16) 


The  mean  under  K  is 


/  "f i ' (xi ) 

^iK  *  eK  (  s 

\  f  4  ( x  4  ) 


\  /*°°f  i  '  (Xi  ) 

i  =  (-Si)  /  -  f  i  (xj-esi  )dx* 

/  f 4  (x<  ) 


(3.17) 
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Expanding  in  a  Taylor  series  about  6=0  and  simplifying  yields 


niK  -  es3  I(fi)  +  e3  J1(e,si,fi) 


(3.18) 


where  J^(6,s^,f^)  represents  the  remaining  terms  in  the 
series.  The  variance  under  K  may  be  treated  similarly 


f i  '  ( ) 
fi(xi) 


=  Si  I  (  f  i  )  +  02  J2(e,si#fi) 


Using  the  Gaussian  approximation. 


(3.19) 


Pr°bH  [Tloc(£)  >  T]  = 


N 

T  -  Z  u  . 


1  °iH 
i-1  iH 


-1  N  2 

~  *  (1-a)  l  s.  l(f.) 

i  =  l  1  x 


=  Prob  [t,  (x)  >  t]s  1 


r  N 

T  -  r  y. 
-  , - i-1 


,iiB« 


(3.20) 


(3.21) 


(3.22) 


Substituting  the  appropriate  expressions  gives 


S  (6)  »  1-9 


*”1(1-Q)  -  k  -  t  Z  J.  (6,s.  ,£,) 

_ K i=l  1  11 

/  e4  n  \  1/2 

(1+ 


(3.23; 


2  2  N  2 

where  k  =  6  £  s.I(f.).  In  the  limit  as  0-*- 0  and  N-k= 

i=l 

holding  k  constant,  the  power  function  is 


6(0)  =  1  -  9(9“'  (l-o)  -  k) 


(3.24) 


provided  and  o^K  are  bounded.  Note  that  this  equation 
for  the  asymptotic  power  function  for  the  locally  optimum 
detector  is  the  same  as  the  equation  for  the  optimum  detector 
[Eq.  (3.13)].  This  is  not  surprising  since  the  Neyman-Pearson 
optimum  detector  approaches  the  normalized  locally  optimum 
detector  as  N-*'®  and  0-K). 

The  expression  [Eq.  (3.13)  or  (3.24)]  for  the  asymptotic 
power  of  optimum  and  locally  optimum  detectors  holds  for  all 
f^  e  F,  where  F  is  a  class  of  symmetric  densities  with  the 
following  properties: 

(1)  f^  has  infinite  support  with  derivatives  of  all  order 
on  (-®,  ®) 


(2)  f  i  ( ®)  =  f  i  ( -®)  *  f  i  ’  ( ®)  *  f  i  ’  ( -®)  *  0 

(3)  There  exists  positive  constants  m^  ano  M2  such  that 

(a)  0<m2<I(fi)<® 


fi’(x)  3 


f j  (x)  dx  <  My  <  ® 


Condition  (3)  is  required  to  satisfy  sufficient  conditions  for 

the  Central  Limit  Theorem  (Appendix  3.2).  In  the  special  case 

N 

where  s^  =  1  for  all  i  and  the  {N^}^  are  identically  distributed, 
that  is  f .  a  f  for  all  i.  Condition  (3)  may  be  replaced  by  the 
weaker  condition: 

(3' )  1(f)  <  • 

The  power  function  [Eq.  (3.13)  or  equivalently  Eq.  (3.24)] 

is  plotted  in  Fig.  3.1  for  various  values  of  a ,  the 

2  2  ^  2 

probability  of  false  alarm.  Note  that  k  *  0  \  sT  Iff.)  is 

i=l  1 

basically  a  signal-to-noise  power  ratio  since 

N  T 

r  0  2  , 

l  0  Si  -  signal  power  parameter 
i  =  l 

and 

r  n  -j  _x 

I  *  noise  power  parameter 

*-i  =  l  J 

N  2  N 

For  the  special  case  when  [  s^/N  =  1  and  the  {n . }.  have 

i=1  2  2 

common  density  f,  one  obtains  k  =6  NI(f).  Thus,  the 

larger  Fisher's  Information  is,  the  larger  the  generalized 

2 

signal-to-noise  ratio  (k  )  will  be,  and  the  larger  the  proba¬ 
bility  of  detection  will  become. 

To  illustrate  the  relationships  between  optimum  detector 
structure,  optimum  detector  performance,  and  Fisher's 
Information  for  the  special  case  of  si  =  1  and  f^  *=  f  for 
all  i,  two  systems  of  densities  will  now  be  considered:  a 


generalized  Gaussian  noise  and  the  Johnson  System  (see 

Chapter  II  -  Section  3  for  a  description  of  these  density 
systems,  graphs  of  their  densities  and  detectors,  and  plots  of 
Fisher's  Information  vs.  the  system  parameters). 

The  power  curves  for  optimum  detectors  operating  in 
generalized  Gaussian  noise  for  several  values  of  the  parameter  c 
are  given  in  Fig.  3.2.  The  power  function  formula  calculated 
above  only  holds  for  c  >  1/2.  For  c  ^  1/2  the  sufficient 
conditions  on  the  Central  Limit  Theorem  are  not  satisfied 
[Condition  ( 3  * )  is  violated].  The  curves  for  small  values  of  c, 
corresponding  to  large  values  for  Fisher's  Information  (Fig.  2.12) 
are  higher  than  those  for  values  of  c  near  2.  The  c  *  1  curve 
is  the  power  function  for  the  sign  detector  operating  in  Laplace 
noise.  The  c  ~  2  curve  is  for  the  linear  detector  in  Gaussian 
noise.  This  set  of  curves  illustrates  the  fact  that  it  is 
easier  to  detect  a  signal  in  additive  noise  which  has  a  large 
value  for  Fisher's  Information  than  it  is  to  detect  a  signal  in 
noise  with  smaller  values  for  Fisher's  Information.  One  could 
consider  Gaussian  noise  to  be  the  most  difficult  noise 
environment  in  which  to  detect  signals  since  Gaussian  noise  has 
minimum  Fisher's  Information  among  all  fixed  variance 
distributions  and  thus  has  a  maximum  value  for  the  noise  power 
parameter  among  all  densities  of  a  given  r.m.s.  noise  power. 

Fig.  3.3  displays  the  power  curves  for  optimum  detectors 
operating  in  noise  from  the  Johnson  Su  System.  Small  values 
of  6,  corresponding  to  large  values  for  Fisher's  Information 
(Fig.  2.16),  yield  higher  curves  than  large  values  of  <5.  As 
6  increases,  the  curves  rapidly  approach  the  power  curve  for 
the  linear  detector  in  Gaussian  noise. 


All  three  of  these  graphs  serve  to  illustrate  the  fact  that 

Fisher's  Information  can  be  used  as  a  measure  of  the  difficulty 

of  detecting  an  additive  signal  in  noise.  Non-Gaussian  noise, 

2 

with  large  values  for  Fisher's  Information,  increases  k  thus 
providing  higher  power  at  a  given  level  than  is  possible  for  the 
Gaussian  distribution.  Impulsive  noise,  with  its  characteristic 
heavy  tails,  has  a  high  value  for  Fisher’s  Information.  The 
optimum  detector  for  this  noise  will  perform  far  better  than  the 
linear  detector  would  in  Gaussian  noise.  This  is  reasonable 
since  limiters  or  blankers  can  be  used  to  suppress  the  heavy 
tails  which  in  effect  will  decrease  the  noise  power  without 
decreasing  the  signal  power.  Hence,  the  increased  structure  of 
non-Gaussian  noise,  if  properly  considered,  can  be  a  great 
asset.  Simulation  results  are  presented  in  Chapter  IV  which 
support  these  statements. 

Although  the  expression  for  the  asymptotic  power  function 

was  derived  assuming  independent  observations,  the  Central  Limit 

Theorem  holds  under  the  weaker  condition  of  strong  mixing  [8]. 

The  expression  will  then  still  be  valid?  however,  the  detector 

in  question  will  no  longer  be  optimum  and  also  the  value  for  k 

will  change  since  the  dependency  structure  will  introduce  cross 

2  2 

terms  in  the  computation  of  a . u  and  aiv- 

In  1 


Section  2:  The  Relationship  Between  Efficacy  and  the  Asymptotic 
Performance  of  Nonlinear  Detectors 


In  Section  1  it  has  been  shown  that  optimum  detectors  for 


densities  with  high  values  for  Fisher's  Information  will  perform 


-88- 


better  than  optimum  detectors  for  densities  with  low  values  for 
Fisher's  Information.  Therefore,  Fisher's  Information  can  be 
used  to  rank  the  densities  within  a  class  of  densities  to 
determine  which  density  provides  the  worst  environment  for 
detection.  However,  in  order  to  prove  that  the  pair  (fQ,D0) 
is  a  saddlepoint  (fo  minimizes  Fisher's  Information  over  the 
class  and  Dq  is  the  locally  optimum  detector  for  fQ),  the 
results  of  Section  1  must  be  generalized  to  account  for  all 
nonlinear  detectors. 

Consider  a  special  case  of  the  problem  treated  in  Section  1 
of  this  chapter. 


H:  Xi  =  n^ 


K:  x-j  =  ni  +  0 


0  ^  0,  l  =1 ,  .  .  .  ,  N 


IN 

The  known  signal  0  is  assumed  constant.  The  are 

assumed  to  be  realizations  of  the  independent  and  identically 

N 

distributed  random  variables  The  noise  is 

distributed  according  to  f,  where  f  e  F  a  class  of  zero  mean 
symmetric  densities  having  finite  Fisher’s  Information. 

The  canonical  nonlinear  detector  for  this  problem  can  be 
written 


(3.25) 


N  {  > 

(x)  =  £  g(xi)  j 

i  =  l  (  < 


decide  K 


decide  H 


(3.26) 


where  g(*)  is  a  memoryless  nonlinearity.  The  threshold  T  is 
chosen  to  achieve  the  desired  probability  of  false  alarm. 
Through  the  proper  selection  of  the  nonlinearity,  this  equation 
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can  represent  both  the  optimum  and  locally  optimum  detectors  for 
the  problem  given  by  Eq.  (3.25).  Note  that  this  detector  is 
both  memoryless  and  non-time-varying.  This  is  because  the 
signal  is  constant  and  the  noise  samples  are  assumed  to  be 
independent  and  identically  distributed. 

As  in  Section  1,  the  Central  Limit  Theorem  can  be  used  to 
calculate  the  asymptotic  power  function  for  the  detector  given 
by  Eq.  (3.26).  When  the  signal  is  absent,  the  mean  and  variance 
of  the  single  sample  test  statistic  are 

PH  *  EHCg(x)]  =  J_”  g(x)f(x)  dx 

°h  *  EHtg2(x)3  “  Ph  =  J.ao92(x)f(x)  dx 

When  the  signal  is  present, 

WK  *  EKCg(x)]  *  £^g(x)f (x-6)  dx  (3.28) 

By  a  change  of  variables  in  the  integrand,  this  becomes 

PK  *  i_00g(x+e)f  (x)  dx  (3.29) 

Also, 

2  *  r  2,  2 
°K  *  EK^9  (x'3  "  PK 

g2(x+e)f(x)  dx  -  (  J°°  g(x+6)f(x)  dx)2 

!\  *“CO 


(3.27) 

“  (  r  g(x)f(x)  dx)2 

•00 


(3.30) 


Using  the  Gaussian  approximation  provided  by  the  Central  Limit 


Theorem,  the  threshold  is  chosen  to  fix  a: 


a  *  Probjj  £t(x)  >  tJ  =j  1  -  * 


T  *  $-1(l-a) M  oH  +  Nvh 


T  -  Nu. 


R7 


(3.31) 


(3.32) 


The  power  function  is  then: 


g  «  Prob 


K  [T(£ 


)  >  T  s  1  -  * 


T  -  Ny, 


(  "a,2)1 


B=  1  -  »  f.-hl-a)  P-- 

\  °K  K 


Now, 


2  /  g*  (x)  f  (x)  dx 


■(B 

-US 


g(x)f(x)  dx 


g  (x+0)f (x)  dx  -  /  g(x+0)f(x)  dx 


Expanding  g(x+6)  and  g*(x+0)  in  a  Taylor  series,  it  can  be 
easily  seen  that 


lim  -S  -  1 


Also, 


8-+Q  \  K 


J g(x+0)f(x)  dx  -  J g (x) f (x)  dx 
/*g2(x+0)f  (x)  dx  -  /  fq(x+Q)f  (x)  dxf 


(3.33) 


(3.34) 


(3.35) 


(3.36) 


(3.37) 


a  constant. 


In  the  limit  as  6  -*■  0  and  N  -►  ®  holding  /5T  0  *  kQ  * 


ft 


ft  e/Eg(f) 


where 


/*" 

‘  2 

/  g'  (x)f(x) 

dx 

_  J-  00 

/•« 

*  T*®  ■  2 

/  g2(x)f(x)  dx  - 

/  g  (x)  f  (x)  dx 

J-cc 

is  the  efficacy  of  the  nonlinear  detector  (see  Appendix  3.3). 

The  asymptotic  power  function  can  now  be  written  as 

0=1-  $(4>“1(l-a)  -  ft  0  /E  (f)  )  ( 

9 

Efficacy  thus  has  a  monotone  relationship  with  the  asymptotic 

performance  of  nonlinear  detectors.  High  values  for  efficacy 

correspond  to  higher  power  at  a  given  signal  strength  than  do 

low  values.  Note  that  for  g(x)  ■  -£'(x)/f(x)  (the  locally 

optimum  detector),  one  finds  E  ( f )  *  1(f)  (Fisher’s  Informa- 

9 

tion),  and  Eq.  (3.40)  reduces  to  the  result  from  Section  1 
[Eq.  (3.24)]. 

Section  3:  The  Saddlepoint  for  the  Minimax  Problem 

The  results  of  Section  2  may  be  used  to  prove  that  the  pair 
(f  ,D  ),  where  f  minimizes  Fisher’s  Information  over  the 


3.38) 


3.39) 


3.40) 
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given  class  of  densities  and  Dq  is  the  locally  optimum 

detector  for  density  fQ,  is  a  saddlepoint  for  the  minimax 

detection  problem  pre.viously  described.  Let  f  be  contained  in 

F,  a  convex  set  of  densities,  and  let  g  be  any  antisymmetric 

detector  nonlinearity.  Also  let  gQ  be  the  locally  optimum 

nonlinearity  for  density  fQ  (i.e.  gQ(x)  *  -fQ‘ (x)/fQ(x) ) . 

If  f  and  g  satisfy 
o  3o 


E  ( f  ) 
g  o 


u0>  , 


Vf> 


(3.41) 


for  all  f  e  F  and  all  g  antisymmetric,  then  (f0»DQ)  *s  a 
saddlepoint.  That  is,  any  g  other  than  gQ  will  decrease  the 
efficacy  (and  thus  the  power  function)  when  fQ  is  the  noise* 

Also,  any  other  noise  f  will  increase  the  efficacy  when  gQ  is 
used  for  detection.  Thus  fQ  is  the  "worst  density"  and  gQ  is 
the  best  detector  for  this  density.  This  is  the  minimax  solution. 
The  detector  is  chosen  for  minimum  error  using  the  density  which 
maximizes  error.  This  problem  is  related  to  the  minimax  estimation 
problem  of  [9]. 

The  proof  of  Eq.  (3.41)  is  similar  to  one  used  by  Huber  [10] 
in  his  work  on  robust  estimation.  His  equation  for  the  asymptotic 
variance  of  an  M-estimate  is  similar  to  the  equation  for  efficacy. 

For  g(x)  antisymmetric,  efficacy  can  be  written 


The  relationship  Eg(fQ) 


5  (f  ) 

g0  ° 


follows  from  the  fact 


that  the  nonlinear  detector  which  maximizes  efficacy  is  the  locally 


optimum  detector  L11J«  To  prove  E  (f  )  E  (f),  first 

°o  ’o 

note  that  E  (£)  is  a  convex  function  of  f  (See  Appendix  3.4) 
3d 

Then  let  ffc  *  (l-t)f0  +  tf^  where  f^  c  F.  Now, 


It  Eg  <V 


(3.43) 


if  and  only  if  E  (f)  is  increasing  in  all  directions  from  fQ. 

o 

Which,  by  the  convexity  of  Eg  (f)  on  the  convex  set  of  densities 

F,  means  that  £  must  minimize  E  (£).  By  a  straightforward 

°  % 

computation, 


*  Vft> 


[“-''/Jo' 


(x)fQ(x)  dx  +  t 


■fso 


(xjf^x)  dx 


(1-t )J  gQ2(x)f0(x)  dx  +  ty^gQ2 (x) fj (x)  dx 


2jfeo  <x)£o<x)  dx  r  -  . 

~r~, - ho  (x)  [£i(*>  -  £o(x>  ] dx 

7902(x)f0(x)  dx 


(3.44) 


[/V(x)£o(x)  dx]  r  J  r  . 

FT*-; - U)  [fi(x)  •  £olx,]dx 


[/|0J<*)£o(x>  a*] 


To  simplify  this  expression,  note  that 


r*  /*"  /f«’<x)\2l 

/  gj(x)f.lx)  dx«  /  (-1)1-2 - [  — - }  I  f_( 


x)  dx 


-00 


where  I(fD)  is  Fisher's  Information  for  density  fQ.  Thus, 


*  Vft’ 


-  f  Ul0‘  (X)  -  902(x>] 

r  f1(x)  -  fG(x)i 

t«o  •'*"  L  J 

L  J 

(3.47) 


Now  this  equation  is  equivalent  to  the  condition  necessary  for  the 
minimization  of  Fisher's  Information.  That  is, 


eft  J<ft> 


t=0 


d 

dt 


/ 


hn 


t)fD’  (x)  +  tf 


dx 


(l-t)f0(x)  +  tfx(x) 


t*0 


(3.48 


/ao  _  qq 

(x)  -  fc'  <x>  J  dx  -j  g02(x)  [ 


90  <x)  |  f,  (x) 


>] 


f0  (x)  |  dx 


Integrating  the  first  integral  by  parts  leaves 


So  1(f)  will  be  minimized  at  f  «  f  if  and  only  if  Eq.  (3.49)  is 

nonnegative.  Thus,  E  (f)  is  minimized  at  f  if  and  only  if 

90  o  1 

1(f)  is  minimized  at  fQ,  which  occurs  if  and  only  if 

/-IC290’(x)  "  9o(x)][f1(x)  -  fQ(x)]  dx  iO  ( 

for  all  f,  e  F.  Therefore,  E_  (f_)  $  E_  (f)  when  f  minimizes 

90  °  90  ° 

Fisher's  Information  over  the  class  F.  Eq.  (3.41)  holds  and  the 
pair  (fo,DQ)  is  a  saddlepoint. 


Section  4:  Minimax  Detectors  -  Some  Examples 

Sections  1,  2,  and  3  have  established  the  fact  that  given  a 
convex  set  of  densities  F,  the  minimax  detector  for  F  is  the 
locally  optimum  detector  D  for  the  density  f  which  minimizes 
Fisher's  Information  over  the  class.  Since  1(f)  «  E(-f’/f)*f)  is 
a  convex  function  in  f  (see  Appendix  3.4)  there  exists  a  density 
with  minimum  Fisher's  Information.  Huber  [3]  has  shown  that  this 
density  is  also  unique.  In  this  section  several  different  classes 
of  densities  are  considered  and  conditions  are  established  for  the 
minimax  detectors. 

First  M  Moments  Fixed 

Consider  the  class  consisting  of  all  densities  which  have  the 
values  ^or  the  first  M  moments.  That  is,  a  density 

f(x)  in  this  class  must  satisfy 


£"xmf(x)  dx  *  ym  m  =  0,1,2, ...,M  ( 

where  p  ■  1.  The  density  with  minimum  Fisher's  Information  in 
this  class  may  be  found  using  the  Euler-Lagrange  equation 

3L  _  d /3L \  n 

TT  £k\7t')m  0 

with  ,  .  2 

L(x,f ,f ' )  -  (IH2LL  )  f(x)  +  i  X  x^f  (x) 

V  f (x)  /  m*0  m 


A  straightforward  calculation  yields  the  following  second  order 
nonlinear  differential  equation 


/  f  '  (x)  \2 
\  f(x)  / 


♦  ?  X  «  0 

f(x)  Bl*0 


The  density  which  satisfies  Eq.  (3.53)  and  the  constraints  given 
by  Eq.  (3.51)  is  the  density  with  minimum  Fisher's  Information. 
The  locally  optimum  detector  for  this  density  is  the  asymptotic 
minimax  detector  for  the  class. 

As  an  example,  consider  the  case  M  *  2  with  * 

2  2 
p2  *  o  (i.e.,  all  densities  with  fixed  variance  c  )• 

Eq.  (3.53)  becomes 


3.  51) 


3.52) 


(3.53) 


0  and 


with  the  constraints 


J  !f(x)  dx  =  1 

J_*x2f  ( x )  dx  =  02 

These  equations  are  satisfied  by  the  Gaussian  distribution.  The 
linear  detector  is  the  minimax  detector  for  the  class.  It 
guarantees  the  lower  bound  of  l/c2  for  the  efficacy.  The 
following  table  displays  the  efficacy  for  both  the  linear  detector 
and  the  locally  optimum  detector  for  some  noise  f^. 

Efficacy 

Gaussian  noise  (q2«l)  non-Gaussian  noise  fi  (o2=l) 

linear  1  1 

detector 

locally 

optimum  some  1  Iff^)  fe  1 

detector  amount 

(“fl'/fl) 

Use  of  the  linear  detector  guarantees  an  efficacy  of  unity.  If 
another  detector  is  used,  the  efficacy  for  Gaussian  noise  will  be 
less  than  unity.  However,  when  the  noise  f^  for  which  the 
detector  was  designed  is  present,  the  efficacy  will  be  given  by 
Fisher's  Information  (see  Section  2)  and  could  be  much  greater 
than  unity.  If  one  either  has  some  prior  knowledge  of  the  noise 
or  is  willing  to  accept  a  decreased  performance  for  Gaussian 
noise  then  one  could  do  far  better  than  the  linear  detector,  even 


though  it  is  the  most  robust  (in  the  minimax  sense)  for  this 


class.  A  conclusion  which  can  be  drawn  is  that  this  class  of 
fixed  moment  densities  may  not  be  very  realistic. 

A  Class  of  Mixtures 


Perhaps  a  more  useful  class  of  densites  is  given  by  the 
mixture 


f(x)«(l-e)g(x)+eh(x)  ,  0  <  e  <  1  ( 


where  g(x)  is  some  known  symmetric  density  (with  -log  g(x) 
convex)  and  h(x)  is  an  unknown  contaminant  which  is  assumed 
symmetric.  Huber  [3]  has  shown  that  the  density  fQ(x)  with 
minimum  Fisher's  Information  in  this  class  is  given  by 


!(1  -  e)g(x)  ,lxl  «  a 

(1  -  e)g (aje”^1  x  ,-a)  ,lxi  >  a 


where  [-a, a] 


is  an  interval 


/.:■ 


g(x)  dx  + 


such  that 

2g(a)  , 

k 


ig ' (x)/g(x)  i 
1 

1  -  e 


$  k  and 


( 


( 


A  proof  of  this  result  is  given  in  Appendix  3.5.  When  g(x)  is 
the  Gaussian  distribution,  the  locally  optimum  detector  for 
density  fQ(x)  is  an  amplifier  limiter.  It's  performance  has 
been  studied  by  Martin  and  Schwartz  [12].  Kassam  and  Thomas  [13] 


.55) 


.56) 


3.57) 
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have  studied  the  various  limiters  which  arise  when  g(x)  is  a 
member  of  the  generalized  Gaussian  distribution  defined  in 
Chapter  III  -  Section  3. 

Other  Possibilities 

Huber  [14]  has  developed  a  technique  to  determine  the 
distribution  with  minimum  Fisher's  Information  which  will  fit  a 
set  of  measured  points.  Given  k  £  2  points,  Huber  has  found 
that  there  is  a  unique  function  which  has  a  minimum  value  for 
Fisher's  Information  among  all  distributions  which  pass  through 
the  points.  This  function  can  be  obtained  by  spline  interpola¬ 
tion.  Let  F(q)  *  ti,  i  =  l,...,k  be  the  k  points.  Assume  that 
£o  =  -<*>,  tQ  =  0  and  £k+1  *  ®,  fck  +  1  *  1*  Then  the  distribution 
Fq(x)  with  minimum  Fisher's  Information  which  passes  through 
these  points  must  satisfy  the  following  four  conditions: 

(1)  Fr(q)  *  t.,  i  «  0,1 . k+1 

(2)  Fq  is  two  times  continuously  differentiable 

(3)  f  (x)  ■  F  '  (x)  >  0  except  over  intervals 

o  o 

Cq,q+l]  where  t.  -  t-+1 
(in  such  cases  Fq’(x)  *  0) 

(4)  On  each  interval  (  /f^  )"/v^  *  X^  *  a  constant 
Also,  the  value  for  Fisher's  Information  will  be 

k 

I(f0)  =  -4  [  (ti+1  -  t i ) X i 

i=0 


(3.58) 


This  result  could  be  very  useful  for  detection  systems. 

Initially  measurements  would  be  made  and  the  density  which 
satisfies  conditions  (1)  through  (4)  would  be  found.  The  optimum 
detector  for  this  density  should  perform  well  for  the  actual 
noi se . 

Two  other  classes  of  densities  in  which  the  member  with 
minimum  Fisher's  Information  has  been  found  are:  The  p-point 
class  and  Huber's  e-normal  class.  The  p-point  class  [4]  is 
defined  as  the  set  of  all  symmetric  distributions  which  are 
continuous  at  +  a  and  which  satisfy  f(x)  dx  *  p  for  some 
fixed  a  and  p.  Huber's  e-rormal  class  [3,10]  is  the  set  of  all 
distributions  which  differ  by  at  most  e  in  Kolmogorov  distance 
from  the  standard  normal  cummulative  distribution  function: 
supiF(x)  -  <t>(x)  i  ^  e.  Both  of  these  classes  may  be  used 
for  minimax  detection. 
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Appendix  3.1 


The  Relationship  between  the  Efficacy  of  the  Locally  Optimum 
Detector  and  Fisher's  Information 


The  efficacy  of  a  detector  is  given  by  Eq.  (1.15)  as, 

2 


.  Um  [re  I  e— o] 

N—  NVarH(T<x)) 


N  -^(X.) 

For  the  locally  optimum  detector  T(x)  *  Z  -  s 

"  1=1  fi(xi> 


From  Eqs.  (3.16)  and  (3.18), 


N  N  _  _  N 

ek  [t (x)]  =  z  v  =  e  z  s .  i (f  • )  +  e z  j.  (e,s.  ,f . ) 

*  ~  i=i  1K  i=i  1  1  i_i  1  11 


N  N  2 

Var„  [T  (x)]  =  I  of  =  Z  s/l(f .) 

H  ~  i=l  lH  i=l  1  1 


Thus , 


1  **  2 

E.  =  lim  ±  Z  s/l(f.) 

loc  N—  N  1*1  1  1 


1  N  2 

For  the  special  case  of  =?  Z  s.  *  1  and  f.*f  for  all  i, 

N  i-1  1  1 


Eloc  "  I(f) 


J 


2- 


Appendix  3.2 

Satisfying  the  Sufficient  Conditions  for  the  Central  Limit  Theorem 


For  the  Central  Limit  Theorem  to  hold,  it  is  sufficient  that: 

(1)  o\  <  - 

(2)  o?  >  m  >  0 

(3)  E{|gi(xi)-yil 3>  <  M  <  ® 

for  all  i,  under  both  H  and  K.  In  the  special  case  of  f^*f  and 
s^l  for  all  i,  Condition  (1)  alone  is  sufficient. 

First  consider  the  Neyman-Pearson  detector. 


'iH 


iK 


e4K2(e,si,fi) 


For  sufficiently  small  6,  Condition  (1)  is  satisfied  by  requiring 
|si|<  «  and  I(f^)<  ®.  Condition  (2)  is  met  if  Is^J  >  m^  >  0  and 
I(fi)  >  m2  >  0.  Utilizing  a  Taylor  series.  Condition  (3)  can  be 
simplified 


log 


fi(xi-6si) 

w 


log 


w 


3 

I  + 


fi(xi) 


fi(Xi) 


f.(x.)  dxA  +  {higher  order  terms) 


Thus,  for  sufficiently  small  6,  condition  (3)  can  be  satisfied, 
under  both  H  and  K,  by  requiring  |s^|  <  <  ®  and 
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For  the  locally  optimum  detector, 


°iK  "  s/ufj.)  +  e2J2(6,si,fi) . 


For  sufficiently  small  6,  Conditions  (1)  and  (2)  can  be  satisfied 
by  requiring  0  <  <  J  |  <  •  and  0  <  m2  <  l(f^)  <  •  just  as  in 

the  Neyman-Pe arson  case.  The  left  hand  side  of  Condition  (3)  under 
both  H  and  K  can  be  written 


-w 

fi<*i> 


si  -  *i  <  lsi>  E 


vv  3 

fi{xi) 


+  l^il 


3  /•»  If.  (x. )  3 
^  |s  |  /  _i_±_  f  (x  >  dx  ♦ 

1  Iw  1  1 


(higher  order  terms] 


Thus,  Condition  (3)  can  be  satisfied  if  |  si  |  <  <  «  and 

r  i«](xji3 


f  fi(xi> 

I  P— 1 -  W  **i  '  *j 

•L  1  1  1  ‘ 


The  Efficacy  of  the  Nonlinear  Detector 


The  efficacy  of  a  detector  is  given  by  Eq.  (1.15}  as. 


E_  -  lim 
N-*— 


1  m  .  “ _ _ 


NVar„  [T  (x)] 


Using  the  nonlinear  detector  of  Eq.  (3.26), 

N 

T (x)  *  I  g(x.) 
i=l  1 

ek[T(x)]  *  NEK[g(x)]  *  N  /  g  (x+6 )  f  (x)  dx 

Aew[t(x)]  =n  f  Ag(x+e)|  f  (x)  dx 

d3  K  "  e-o  ( e-o 

*  N  f  g'  (x)  f  (x)  dx 


Also, 


VarH[T(x)]  -  NVarH  [g(x)] 


K[l 


g  (x)f(x)  dx 


*  (/. 


g (x) f (x)  dx 


Thus,  for  the  nonlinear  detector, 


ET  "  Eg<f>  ' 


£  y*  g* (x)f (X)  dx  J 
g‘(x)f(x)  dx  -|  J  g(x)f(x)  dxj 


For  the  special  case  g 


f '  (x) 


,  E  (f )  *  1(f)  as  in  Appendix  3.1 


A  Proof  that  Efficacy  is  a  Convex  Function 
of  f  when  g  is  Antisymmetric 


Efficacy,  for  g(x)  antisymmetric,  is  given  by  Eq.  (3.42)  as, 

i2 


E  (f ) 


[f 3'  (x)  f  (x)  dxj 


Let 


and 


P 

—  oo 

v(f)  « J 


g  (x)f(x)  dx 


0(f) 


(x)f(x)  dx 


g  (x)f(x)  dx 


Note  that  U"(f)  *  V"(f)  *  0  and  V(f)  >  0  for  all  but  degenerate 
cases.  Thus, 


E  (f)  »  W(f)  «  C— 1 ’fU, 
9  V(f) 


is  a  convex  function  of  f  because 


W"  (f) 


2  [u  1 V  -  PvQ2 


5-  0 


In  particular  1(f)  *  E  j, (f)  is  convex  because  the  locally  optimum 
detector  is  antisymmetrFc. 


Appendix  3.5 


A  Proof  that  f  (x)  of  Eq.  (3.56)  Minimizes  Fisher's 
Information  for  tne  Mixture  Class  Specified  by  Eq.  (3.55) 


This  proof  is  similar  to  one  given  by  Huber  [10].  First  fQ(x) 
is  proven  to  belong  to  the  mixture  class.  Then  it  is  proven  that 
fQ(x)  minimizes  Fisher's  Information  over  the  class.  From 
Eqs.  (3.56)  and  (3.57) 


fo(x) 


{ 


(1  -  e)g(x) 


*  I  x  I  ^  a 


(1  -  Og(a)e-k(lx!-a),|x|  >  a 


where  [-a, a]  is  an  interval  such  that 


g'  (x) 
gTxT 


•$  k  and 


L 


g ( x )  dx  +  ^2<a)  -  1 


rc^rr 


-a 


The  class  is  defined  as  all  densities  of  the  form 


f(x)  ■  (1  -  c)g(x)  +  eh(x) 


where  g(x)  and  h(x)  are  symmetric  densities,  0  &  e  <  1,  and  -log  g(x) 


is  convex.  The  density  fQ(x)  is  an  element  of  the  class  if 


ho(x>  -  I  Vx)  -  11  I  C> 


g  (x) 


{" 


x  ^  a 


.  x  >  a 


(1  -  e)  „,,^-*(|xl-a)  .  (1.1,0 


a  lx) 


is  a  density.  To  show  hQ(x)  0,  note  that  since  -log  g(x)  is 
convex,  it  lies  above  its  tangents  at  ±a. 

-log  g  (x)  i  -log  g(a)  +  ^ - ^(x  -  a)  ,x  >  a 

-log  g (x)  *  -log  g(-a)  +(  -  (x  +  a)  ,x  <  -a 

\  g(-a)  / 

Also'  ~  frif1  “  k  and  "  fFif1  “  "k‘  So  that' 


-log  g (x)  »  -log  g(a)  +  k(|x|  -  a) 


Thus , 


g(x)  «  g(a)e-k(lxl-a' 


»  x  >  a 


and 


(1  ' -  g(a)e’k(lxl‘a)  -  (1  ~-c)  g(x)  *  0  for  |x|  >  a 


Hence  hQ(x)  >  0.  Also, 


/. 


hQ(x)  dx 


(1 


x  >a 


x  >a 


g (a)  f  e"k{lxl'a)  dx  -  (1  ~  f  g(x)  dx 

- 1 ..  i  ^  _  -/i  __  i _ 

H 

-]  - 


g (x)  dx 


(1  -  c) 
c 


=  1  _  (1  -  e)  . 


Thus  hQ(x)  is  a  density  and  fQ(x)  is  contained  in  the  mixture  class 


of  interest. 


From  Eq.  (3.50),  fQ(x)  minimizes  Fisher’s  Information  if  and 
only  if 


J  -  J  I2g0’(x)  -  gc2(x)  ]  [f^x)  -  fQ(x)]  dx  *  0 
for  all  f^(x)  in  the  class.  Now, 


-  g'-l*] 

gTxP 


»  x  ^  a 


gQ(x) 


k  sgn(x)  , |x|  >  a 


go*  (x)  = 


(-  fuf )  •1*1 


^  a 


x  >  a 


-log  g(x)  convex  => 


/.  g'  (x)  V 

V  gT xT  ) 


>  o 


g0’  (x)  >.  o 


k2  -  g02(x)  *  0  and 

For  | x |  >  a,  go' (x)  =  0 


Also,  |-^y  -  4  k  for  | x |  ^  a.  Thus, 

2gQ'  (x)  -  g02(x)  +  k2  >,  0  for  |x|  ^  a. 

and  g02(x)  »  k2;  thus  2gQ' (x)  -  go2(x)  +  k2  ■  0.  Then,  adding 

2 

and  subtracting  k  from  inside  the  integrand 

J  -  f  I2g0’(x)  -  g02(x)  ♦  k2](f1(x)  -  fQ(x)]dx  -  k 2f  [f^  (x)  - 
•'-a  J-cd 


For  |xj  s  a,  f1  (x)  -  fQ(x)  «  ch^(x)  l  0.  Also,  the  second 
integral  equals  zero.  Thus  J  >,  0  and  f0(x)  minimizes  Fisher's 
Information  over  the  entire  class. 


fQ (x) ]dx 
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AN  ADAPTIVE  DETECTOR  FOR  SIGNALS  IN  NON-GAUSS IAN  NOISE 


Adaptive  detectors  are  often  used  when  near  optimum 
performance  is  required  for  a  wide  variety  of  noise 
environments.  They  are  often  more  complex  than  non-adaptive 
detectors;  however,  they  are  inherently  capable  of  handling 
noise  whose  statistics  are  unknown  or  time-varying.  In  this 
chapter  an  adaptive  detector  is  presented  and  simulation 
results  are  given  which  indicate  that  this  detector  will 
perform  well  for  several  different  background  noises.  Section  1 
starts  with  a  description  of  the  detector.  Different  measures 
of  density  function  tail  behavior  are  then  explored  to  determine 
an  algorithm  which  may  be  used  to  adapt  the  detector.  In  order 
to  test  the  detector' 6  performance,  it  is  implemented  on  a 
digital  computer.  Section  2  presents  the  results  of  this 
simulation. 


Section  1:  Development  of  an  Adaptive  Detector  using  the 
Johnson  System  as  a  Noise  Model 

Figure  4.1  displays  the  basic  structure  for  an  adaptive 
detector  which  can  be  used  for  the  hypothesis  testing  problem 
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described  in  Chapter  I.  Information  is  extracted  from  the 
observations  in  order  to  select  a  nonlinearity  to  be  used  for 
the  detection.  If  one  has  a  reasonable  model  for  the  noise, 
with  perhaps  a  few  unknown  parameters,  the  selection  of  the 
nonlinearity  becomes  less  difficult.  The  parameters  may  be 
estimated  from  the  observations  to  determine  the  density 
function  within  the  model  which  most  closely  approximates  the 
noise.  The  optimum  nonlinearity  for  this  density  should  be 
close  to  optimum  for  the  actual  noise. 

The  Johnson  Su  System,  presented  in  Chapter  II  -  Section  3, 
provides  a  useful  noise  model.  It  was  chosen  because  it  is  a 
relatively  simple  parameterized  family  of  densities  which 
possesses  noise  blankers  for  the  optimum  detector  nonlinearities 
(see  Fig.  2.15  and  Appendix  2.2).  These  nonlinearities  are 
nearly  linear  about  the  origin.  However,  large  observations  are 
heavily  suppressed.  The  Johnson  System  for  zero  mean  symmetric 
distributions  [Eq.  (2.35)]  can  be  described  by  two  parameters, 

6  and  X.  The  shape  parameter  6  is  directly  related  to  the 
tails  of  the  density.  The  scale  parameter  X  can  be  related  to 
the  variance  c2  by  the  following  equation 


“(• 


A 


In  order  to  use  the  Johnson  System  nonlinearities,  an  adaptive 
detector  must  estimate  the  parameters  6  and  X.  Given 
estimates  of  6  and  o2,  X  can  easily  be  estimated  using  Eq.  (4.1). 


The  shape  parameter  6  can  be  estimated  by  utilizing  its 
relationship  to  tail  behavior.  Three  measures  of  density 
function  tail  behavior  will  now  be  described  in  order  to 
facilitate  the  selection  of  the  appropriate  measure  for  the 
estimation  of  6. 


Measures  of  Density  Function  Tail  Behavior 


It  is  desirable  that  a  measure  of  tail  behavior  be 
independent  of  location,  be  independent  of  scale,  and  exist  for 
all  distributions.  The  most  common  measure  of  tail  behavior, 
the  standardized  fourth  central  moment  defined  by 


E(x  -  y)4 
[E(x  -  y) 2]2 


when  y  =  E(x) 


possesses  the  first  two  of  these  properties.  However,  any 
simple  function  of  moments  cannot  exist  for  all  distributions. 
This  leads  to  the  use  of  functions  of  percentage  points  as  a 
measure  of  tail  behavior.  One  such  measure  [1]  is 


R(Pl> 

R(P2) 


where  R(p)  *  F”'i(p),  0  f  p  $  1,  and  F(»)  is  the 
cumulative  distribution  function.  The  percentage  points  p^ 
and  p2  are  chosen  on  the  tail  and  on  the  shoulder  of  the 
density,  respectively.  That  is,  the  point  p^  is  selected  so 
that  the  p.  quantile,  x,  *  R(p  ),  increases  monotonically 
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as  the  tail  becomes  heavier.  The  point  is  chosen  so  that 
the  p2  quantile,  x2  *  R(p2),  remains  relatively  stationary 

regardless  of  tail  behavior.  The  measure  t  exists  for  all  ® 

reasonable  distributions  and  is  independent  of  both  location  and 
scale • 

Another  useful  measure  of  tail  behavior  [1]  is  '• 


R' <Pi) 

p  =  - 

R(Pi> 

where  p^  is  chosen  on  the  tail  of  the  density.  This  p 
possesses  many  of  the  same  properties  as  does  t.  Its  major 
advantage  over  t  is  that  the  point  on  the  shoulder  of  the 
density,  which  may  be  difficult  to  choose,  is  not  necessary. 

The  points  p^  and  P2  are  chosen  on  the  tail  and  on  the 
shoulder  of  the  Johnson  density  system.  The  inverse  of  the 
cumulative  distribution  function  for  the  Johnson  System  is 


(4.4) 


x  *  f"1(p)  -  Xsinh  ^ 


(4.5) 


where  4>_1(»)  is  the  inverse  of  the  unit  Gaussian  cumulative  A, 

distribution  function.  Figure  4.2  displays  x  versus  p  for 
several  values  of  the  parameter  6  (A  is  selected  by  Eq.  (4.1) 

so  that  a2  remains  constant  .  For  values  of  p  i  .995,  as  6  . 

decreases  so  that  the  tail  becomes  heavier,  the  quantile  x 
increases.  The  larger  p  is,  the  easier  it  becomes  to 


differentiate  between  the  different  values  of  6.  For  this 
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reason  and  to  account  for  detectors  operating  at  a  low  false 
alarm  rate,  the  point  p^  *  .9999999  was  chosen.  In  order  to 
choose  p2  on  the  shoulder  of  the  system,  note  that  for  values 
of  p  between  approximately  .975  and  .98  the  quantile  x  is 
relatively  insensitive  to  the  parameter  6.  In  this  study  of 
tail  behavior  the  point  p2  *  .98  was  selected. 

The  standardized  fourth  central  moment  82  and  the  tail 
measures  t  and  p  for  the  Johnson  System,  normalized  by  their 
values  for  the  Gaussian  distribution,  are  displayed  in  Fig.  4.3. 
All  three  measures  increase  monotonically  as  the  tails  become 
heavier  (6  decreases).  Any  one  of  them  could  be  used  to 
estimate  a  value  for  6  given  a  set  of  observations.  Fig.  4.4 
shows  the  relationships  between  t  and  $2  and  between  p 
and  g2«  Assuming  the  Johnson  System  model  held,  one  could 
measure  whichever  parameter  was  easiest  and  then  calculate  the 
remaining . 

Due  to  the  importance  of  Fisher's  Information  in  estimation 
and  detection  problems  (see  Chapter  III),  the  relationships 
between  Fisher's  Information  and  the  various  tail  measures  were 
explored.  Fig.  4.5  displays  o2I(f)  versus  f}2,  t  and  p 
(normalized).  Given  a  value  for  either  82*  t,  or  p  and 
assuming  that  the  Johnson  System  noise  model  is  a  good 
approximation  to  the  actual  noise  environment,  one  could  obtain 
a  unique  value  for  Fisher's  Information  from  Fig.  4.5.  This 
procedure  would  probably  be  easier  than  estimating  Fisher's 
Information  directly  for  an  unknown  noise  distribution. 

The  usefulness  of  these  three  tail  measures  for  a  different 
noise  model,  the  generalized  Gaussian  noise  model  presented  in 
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Chapter  II  -  Section  3,  was  also  considered.  Fig.  4.6  shows 
&2>  t  and  p  versus  the  density  system  parameter  c.  As 
for  the  Johnson  System,  the  three  measures  increase  monoton- 
ically  as  the  tail  becomes  heavier  (c  decreases).  Through  these 
simple  relationships  a  value  for  c  could  easily  be  estimated  by 
measuring  either  &2>  T  or  P*  Fig.  4.7  displays  the 
monotone  relationships  which  enable  one  to  estimate  $2>  T 
and  p  from  each  other.  A  plot  of  Fisher's  Information  versus 
&2>  t  and  p  is  given  in  Fig.  4.8. 

An  additional  use  for  these  tail  measures,  which  was  only 
briefly  explored,  would  be  as  tools  for  the  comparison  of  noise 
models.  Various  densities,  whose  tail  behavior  are  identical  as 
measured  by  either  {Jj*  T  or  P»  could  be  compared  in 
terms  of,  for  example,  Fisher's  Information  or  efficacy  with 
various  detectors. 

Implementation  of  the  Johnson  System  Adaptive  Detector 

Any  one  of  the  three  measures  of  density  function  tail 
behavior  could  be  used  to  estimate  the  Johnson  System  shape 
parameter  6*  The  measure  t  was  selected  because  it  appeared 
to  be  the  easiest  to  estimate  from  data.  One  need  only  measure 
two  quantiles  and  take  the  ratio.  The  measure  62  would 
require  taking  a  fourth  moment.  This  would  probably  be  more 
difficult  in  most  applications.  Although  p  requires  only  one 
quantile,  the  rate  of  increase  in  the  quantile  would  also  have 
to  be  calculated.  This  could  be  achieved  by  measuring  a  few 
points  above  and  below  the  quantile.  This  method,  however,  may 


i 

1 

1 


not  be  very  accurate. 


The  point  p^  *  .9999999,  selected  previously  for  the 
study  of  the  tail  measures,  can  no  longer  be  used.  This 
quantile  is  too  far  out  on  the  tail  to  be  estimated  accurately 
with  reasonable  sample  sizes.  Checking  Fig.  4.2  one  finds  that 
the  smallest  value  of  p  which  will  still  allow  one  to  differen¬ 
tiate  accurately  between  the  various  values  of  6  is  .995. 

Consequently,  p^  =  .995  was  chosen.  The  point  p2  was 
changed  from  .98  to  .975  so  that  the  two  points,  p^  and  p2, 
would  remain  an  adequate  distance  apart.  Choosing  p^  =  .995 
and  p2  *  .975,  one  can  write  t  for  the  Johnson  System  as 

sinh ( 2. 5* 6/ 6) 

t  *  _  (4.6) 

sinh{ 1. 960/ 6) 

This  equation  is  plotted  in  Fig.  4.9.  The  curve  is  similar  to 
the  one  in  Fig.  4.3. 

To  estimate  6  given  a  set  of  observations,  t  is  first 
estimated  from  the  two  quantiles 

the  99.5%  point 

T  -  -  (4.7) 

the  97.5%  point 

and  then  6  is  estimated  using  Eq.  (4.6).  To  estimate  the 
scale  parameter  A,  the  sample  variance  is  first  formed 
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(4.8) 
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where  x  is  the  sample  mean.  Then,  using  a2  and  the  estimate 
for  6,  an  estimate  for  X  may  be  found  from  Eq.  (4.1).  Using 
these  estimates  for  6  and  X  the  Johnson  System  density  which 
most  closely  approximates  the  noise  can  be  found.  The  optimum 
detector  for  this  density  should  be  close  to  optimum  for  the 
actual  noise. 

Fig.  4.10  displays  the  block  diagram  for  an  implementation 
of  a  Johnson  System  adaptive  detector.  The  detector  is  of  the 
same  form  as  that  shown  in  Fig.  4.1.  Twenty  samples  were  used 
to  make  each  decision.  After  every  10,000  samples,  t  and  the 
noise  variance  are  estimated  and  new  estimates  of  6  and  X 
are  formed.  The  appropriate  nonlinearity  from  the  Johnson 
System  is  then  selected  for  use  on  the  next  10,000  samples. 
Simulation  studies  have  been  performed  with  a  variety  of  noise 
densities  on  the  input.  The  results  from  these  studies  are 
presented  in  the  next  section. 

A  key  problem  with  the  adaptive  detector  of  Fig.  4.10  is 
that  as  the  input  changes  and  the  detector  adapts,  the 
probability  of  false  alarm  will  change  unless  the  threshold  is 
adjusted  properly.  To  eliminate  this  difficulty,  a  noisy 
reference  channel  was  added  (Fig.  4.11).  The  reference  channel 
is  a  stream  of  samples  with  the  same  distribution  function  as 
the  noise  on  the  actual  channel.  However,  the  two  channels  are 
statistically  independent.  A  detector,  identical  to  the  one  on 
the  actual  channel,  is  placed  on  the  reference  channel.  From 
its  output,  an  estimate  for  the  probability  of  false  alarm  (a) 
can  be  found.  The  threshold  can  then  be  adjusted,  in  both 
channels,  to  attempt  to  achieve  a  =  0.1.  Also,  an  estimate 


Fig.  4.10  -  An  Adaptive  Detector  Based  on  the  Johnson  System 


Fig.  4.11  -  An  Adaptive  Detector  Based  on  the  Johnson  System 
Noisy  Reference  Channel  Added. 


of  the  probability  of  detection  (8)  can  be  obtained  by 
counting  the  number  of  correct  decisions  made  when  the  signal  is 
present . 

Section  2:  Simulation  Results 

The  linear  detector,  various  nonlinear  optimum  detectors, 
and  the  adaptive  detector  which  was  described  in  Section  1  have 
all  been  simulated  on  a  digital  computer.  Various  noise 
backgrounds  were  considered.  All  detectors  utilized  a  noisy 
reference  channel  to  aid  in  the  adjustment  of  the  threshold. 

The  detectors  attempted  to  maintain  a  probability  of  false  alarm 
(a)  equal  to  0.1.  The  following  graphs  display  estimates  of 
the  probability  of  false  alarm  (a),  the  probability  of 
detection  (8),  and  the  total  probability  of  error  (PE)  for 
the  various  detectors  and  noise  situations.  The  threshold 
levels  (T)  are  also  reported.  Thirty  estimates  of  these  values 
are  displayed.  Each  estimate  represents  500  decisions  made  by 
the  detector,  20  samples  per  decision.  The  first  few  estimates 
were  taken  before  the  threshold  had  stabilized.  For  this 
reason,  when  averaging  these  numbers  the  first  ten  estimates 
should  be  ignored. 

Figs.  4.12  through  4.15  display  a,  8«  PE  and  T  for 
both  the  optimum  and  adaptive  detectors  operating  in  Gaussian 
noise.  In  this  case  the  optimum  detector  is  just  the  linear 
detector.  Due  to  the  statistical  fluctuations  of  the  input,  it 


is  hard  to  determine  precisely  the  performance  of  these 
detectors.  However,  it  appears  that  their  performances  are 
nearly  the  same. 

Figs.  4.16  through  4.19  display  a,  6,  PE,  and  T  for 
the  optimum,  linear,  and  adaptive  detectors  in  Laplace  or  double 
exponential  noise.  The  Laplace  distribution  has  heavier  tails 
than  the  Gaussian.  The  optimum  detector  is  the  amplifier 
limiter.  The  adaptive  detector  performs  nearly  as  well  as  the 
optimum  detector  (see  Fig.  4.17  or  4.18).  The  linear  detector 
is  about  10%  worse. 

When  Johnson  noise  is  on  the  input  of  the  adaptive 
detector,  one  would  expect  it  to  perform  especially  well.  When 
it  properly  estimates  the  parameters  6  and  X,  it  will  be 
using  the  actual  optimum  detector.  Figs.  4.20  through  4.23  show 
that  this  did  occur.  The  adaptive  detector's  performance  is 
nearly  identical  to  that  of  the  optimum  detector.  Again  note 
that  the  linear  detector  is  a  rather  poor  detector  for  this 
heavy  tailed  density. 

A  Gaussian-Gaussian  mixture  (presented  in  Chapter  II  - 
Section  3)  was  used  to  achieve  a  density  with  still  heavier 
tails.  One  Gaussian  distribution,  the  contaminant,  was  given  a 
variance  one  hundred  times  larger  than  that  of  the  background 
Gaussian  distribution.  The  contaminant  occurred  with 
probability  0.1.  The  resulting  density  was  normalized  to  have 
variance  unity.  Figs.  4.24  through  4.26  show  the  performances 
of  the  optimum,  linear  and  adaptive  detectors  for  this  heavy 
tailed  density.  Fig..  4.27  displays  the  thresholds.  One  can 
easily  see  that  the  linear  detector  does  not  perform  very  well. 
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Fig.  4.12  -  Probability  of  False  Alarm:  Gaussian  Noise 
Variance  1,  Signal  Strength  =0.5 
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Probability  of  Detection:  Gaussi 
Variance  1,  Signal  Strength  =0.5 


Noise 
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Fig.  4.15  -  Threshold  Levels:  Gaussian  Noise 
Variance  1,  Signal  Strength  =  0.5 


Fig.  4.16  -  Probability  of  False  Alarm:  Laplace  Noise 
Variance  1,  Signal  Strength  =0.5 
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Probability  of  Detection:  Laplace  Noise 
Variance  1,  Signal  Strength  =0.5 
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Total  Probability  of  Error:  Laplace  Noise 
Variance  1,  Signal  Strength  =0.5 
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Fig.  4.19  -  Threshold  Levels:  Laplace  Noise 
Variance  1,  Signal  Strength  =0.5 
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Probability  of  Detection:  Johnson  Noise 
6=1,  Variance  1,  Signal  Strength  =0.5 
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Fig.  4.22  -  Total  Probability  of  Error:  Johnson  Noise 
6=1,  Variance  1,  Signal  Strength  =0.5 
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Fig.  4.23 


Threshold  Levels:  Johnson  Noise 
i  =  1,  Variance  1,  Signal  Strength  =0.5 
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Fig.  4.24  -  Probability  of  False  Alarm:  Gaussian-Gaussian  Mixture 
Variance  1,  Signal  Strength  =0.5 
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Total  Probability  of  Error:  Gaussian-Gaussian  Mixture 
Variance  1,  Signal  Strength  =0.5 
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Probability  of  False  Alarm:  Gaussian-Gaussian  Mixture 
Variance  1,  Signal  Strength  =0.2 
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Fig.  4.29  -  Probability  of  Detection:  Gaussian-Gaussian  Mixture 
Variance  1,  Signal  Strength  =0.2 
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Total  Probability  of  Error:  Gaussian-Gaus 
Variance  1,  Signal  Strength  =0.2 
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Threshold  Levels:  Gaussian-Gaussian  Mixture 
Variance  1,  Signal  Strength  =0.2 
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Fig.  4.32  -  Simulation  Results: 


However,  both  the  optimum  detector  and  the  adaptive  detector 


perform  so  well  it  is  hard  to  differentiate  between  them.  For 
this  reason,  this  case  was  repeated  with  a  decreased  signal 
strength. 

Using  0.2  for  the  signal  rather  than  0.5,  the  performances 
for  these  detectors  are  more  readily  apparent.  Figs.  4.28 
through  4.31  display  a,  5,  P^.  and  T.  Note  that  the  scale 
on  the  vertical  axis  for  both  Figs.  4.29  and  4.30  have  been 
changed.  The  linear  detector's  performance  is  now  totally 
unacceptable.  However,  the  adaptive  detector's  performance  is 
close  to  that  of  the  optimum  detector,  even  with  this  relatively 
small  signal. 

Fig.  4.32  displays  the  average  value  of  o,  $  and  the 
total  probability  of  error.  In  computing  this  average  the  first 
ten  estimates  were  ignored  to  allow  the  threshold  to  stabilize. 
Due  to  the  fact  that  the  estimated  value  for  a  did  deviate 
from  0.1,  the  estimates  of  0  should  not  be  compared  blindly. 

The  column  for  total  probability  of  error  removes  this 
variability  and  is  probably  the  best  column  for  comparison 
purposes.  In  all  cases  the  adaptive  detector  is  found  to  be 
nearly  optimum.  For  the  heavier  tailed  densities  it  is  also 
found  to  perform  far  better  than  the  linear  detector. 


A  Verification  of  Some  Results  from  Chapter  III 


The  results  of  Chapter  III  -  Section  1  can  be  used  to  find 
asymptotic  values  (small  signal,  large  numbers  of  observations) 
for  the  total  probability  of  error  for  optimum  detectors.  The 


table  in  Fig.  4.33  displays  the  five  cases  which  were 
simulated.  Adding  the  value  of  0.1  for  a  to  1  -  6  from  Eq.  (3.13) 
yields  the  asymptotic  probability  of  error  for  the  optimum 
detector.  The  final  column  lists  the  probability  of  error 
observed  from  simulations.  These  numbers  are  in  close 
agreement.  This  verifies  both  Eq.  (3.13)  and  the  accuracy  of 
the  simulation  results.  Note  that  the  total  probability  of 
error  for  densities  with  high  values  for  Fisher's  Information  is 
significantly  less  than  for  densities  with  low  values  for 
Fisher's  Information.  This  supports  the  conclusion  from  Chapter 
III  that  Fisher's  Information  can  serve  as  a  measure  of  the 
difficulty  of  detecting  an  additive  signal  in  noise.  Figs.  4.34 
through  4.42  also  serve  to  illustrate  this  point. 

Fig.  4.34  displays  the  probability  of  detection  (B)  for 
optimum  detectors  in  Gaussian,  Laplace,  Johnson  (c  *  1),  and 
Gaussian-Gaussian  mixture  noise.  Fig.  4.35  displays  B  for  the 
adaptive  detector  and  Fig.  4.36  displays  B  for  the  linear 
detector.  For  both  adaptive  and  optimum  detectors,  densities 
with  high  values  for  Fisher's  Information  yield  higher  power 
than  densities  with  low  values  for  Fisher's  Information.  The 
linear  detector  behaves  about  the  same  for  all  cases.  One  can 
think  of  the  linear  detector  as  being  uniformly  poor  for  all 
non-Gaussian  densities. 

Figs.  4.37,  4.38  and  4.39  display  both  a  and  8  for  the 
optimum,  adaptive  and  linear  detectors.  These  curves  are 
included  to  show  all  of  the  data  plotted  to  the  same  scale. 

Figs.  4.40,  4.41  and  4.42  display  the  probability  of  error  for 
optimum,  adaptive,  and  linear  detectors.  The  same  observations 


-155- 


Noi  se 


Gaussian 


Laplace 


Johnson 
(  6=1) 


Gaussian- 

Gaussian 

Mixture 


Gaussian- 
Gaussi an 
Mixture 
( signal=0 


Fisher  '  s 
Information 


Calculated  Asymptotic* 
Probability  of  Error 


1 


.  270 


2 


.130 


2.386 


.  115 


* 

Observed 

Probability  of  Erro 


.  269 


.  160 


.  112 


9.  02 


.100 


.  102 


9.02 


.180 


.2) 


.  180 


*  Values  reported  to  3  significant  digits  only 


Fig.  4.33  -  Comparison  of  Computed  and  Observed 

Probability  of  Error  for  Optimum  Detectors 
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Fig.  4.37  -  a  and  6:  Optimum  Detectors 
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Fig.  4.39  -  a  and  6:  Linear  Detector 
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which  were  made  for  the  power  curves  can  be  made  here.  Highly 
non-Gaussian  noise,  with  high  values  for  Fisher's  Information, 
can  be  an  asset  since  the  increased  structure  of  the  noise  can 
be  utilized  to  increase  detector  performance  over  that  possible 
with  just  Gaussian  noise.  Nonlinear  processing,  however,  is 
required  to  achieve  this  improvement.  The  linear  detector's 
performance  remained  relatively  the  same  for  all  densities 
studied . 


The  Performance  of  the  Sign  Detector 


The  sign  detector  was  also  considered  in  this  simulation 
study.  The  sign  detector  or  hard  limiter  may  be  expressed  as 


N 


T(x)  *  l  sgn ( ) 
i  =  l 


where , 


sgn(x) 


I  +1 
0 

-1 


,  x>0 
,x*0 
,  x  <0 


Figs.  4.43  through  4.46  display  a,  6,  PE  and  the  threshold 
for  the  sign  detector  in  both  Gaussian  and  Laplace  noise.  As 
before,  N  =  20  samples  were  used.  In  both  cases,  although  the 
threshold  had  stabilized  near  6,  the  curves  for  a  and  6 
cycled  between  two  levels.  To  understand  this,  a  and  6  were 
calculated  numerically. 
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Fig.  4.45  -  Total  Probability  of  Error:  Sign  Detector 


15 


2D 


25 


30 


successive  estimates  (time) 


Laplace  Noise 
Gaussian  Noise 


Fig.  4.46  -  Threshold  Levels:  Sign  Detector 
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Under  H,  when  the  noise  has  a  symmetric  distribution, 
sgn(x^)  will  be  -1  with  probability  1/2  and  +1  with 
probability  1/2.  Considering  the  sum  of  observations  as 
binomial  trials,  the  probability  distribution  for  T(x)  can  be 
calculated.  It  consists  of  impulses  at  all  even  numbers  from 
-20  to  20  as  shown  in  Fig.  4.47.  Summing  these  impulses  it  can 
be  seen  that 


a  =  ProbH[T(x )  >  T]  =  0.132 

=  0.058 


for  4  $  T  <  6 
for  6  ^  T  <  8 


(4.10) 


The  oscillations  in  a  are  due  to  the  following  reasons.  Whenever 
4  ^  T  <6,  the  detector  finds  a  ~  0.132.  The  threshold  is 
then  raised  slightly  in  an  attempt  to  lower  a  to  0.1.  As  soon  as 
T  reaches  6  (  6  T  <  8)  ,  a  falls  to  approximately  0.058. 

Since  a  *  0.1  is  impossible  to  achieve,  the  system  continues  to 
oscillate  with  T  very  close  to  6.  A  corresponding  oscillation  in 
8  occurs. 

As  a  check,  8  for  Gaussian  noise  can  be  calculated  easily. 

For  a  signal  of  1/2,  sgn(x^)  equals  -1  with  probability  0.3085 
and  +1  with  probability  0.6915.  The  distribution  of  T(x)  is  given 
by  Fig.  4.48.  For  4  ,$  T  <  6,  8  =  .746.  When  6  .$  T  <8, 

8  =  .575.  These  numbers  are  in  close  agreement  with  those 
actually  observed  (Fig.  4.44). 

These  oscillations  are  a  serious  drawback  for  the  sign 
detector.  Probably  the  best  way  to  eliminate  this  problem  is  to 
utilize  a  randomized  test.  Such  tests  have  not  been  considered 
here  since  the  sign  detector  is  the  only  detector  mentioned  above 
which  yields  outputs  which  are  discrete  random  variables. 
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4.47  -  Probability  Distribution  of  the  Output  of  the 
Sign  Detector:  Symmetric  Noise  Density, 

No  Signal 
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Fig.  4.48  -  Probability  Distribution  of  the  Output  of  the 
Sign  Detector:  Gaussian  Noise,  Signal  =  1/2 
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CONCLUSION 


In  this  dissertation,  methods  for  the  detection  of  known, 
discrete-time  signals  in  impulsive  noise  have  been  investi¬ 
gated.  After  a  brief  introduction  to  the  detection  problem  in 
Chapter  I,  Chapter  II  concentrated  on  impulsive  noise.  The 
characteristics  and  sources  of  impulsive  noise  were  considered 
briefly.  Several  impulsive  noise  models  were  explained.  Three 
systems  of  density  functions  were  then  presented  to  provide 
first  order  non-Gaussian  noise  models  which  are  simple  enough 
to  use  in  detection  problems.  These  three  systems  are:  a 
generalized  Gaussian  noise,  the  Johnson  Su  System,  and  a 
mixture  model.  Further  research  is  needed  in  developing 
impulsive  noise  models  which  are  tractable  in  detection 
problems.  Middleton's  model  Cl]  is  probably  the  most  complete 
and  comprehensive  to  date.  However,  it  is  quite  complex  and  is 
unable  to  model  the  dependency  which  has  been  observed  for 
impulsive  noise  samples.  Research  concentrating  on  modeling 
this  dependency  structure  and  on  developing  higher  order 
densities  for  non-Gaussian  noise  is  needed. 

In  Chapter  III,  the  importance  of  Fisher's  Information  in 
assessing  the  asymptotic  performance  of  detectors  has  been 
stressed.  It  has  been  shown  that  an  impulsive  noise  with  a 


high  value  for  Fisher’s  Information  provides  a  better 
environment  for  detection  than  does  a  Gaussian  noise  of  equal 
power.  The  increased  structure  of  impulsive  noise  may  be 
exploited  to  increase  detector  performance  over  that  possible 
for  Gaussian  noise.  Nonlinear  processing,  however,  is  required 
to  achieve  this  improvement.  To  further  illuminate  the 
relationship  between  Fisher's  Information  and  detection,  it  may 
be  beneficial  to  explore  the  connection  between  Fisher's 
Information  and  entropy.  To  illustrate  this  connection, 
observe  that  among  all  densities  of  a  given  variance,  the 
Gaussian  distribution  both  minimizes  Fisher's  Information  and 
maximizes  entropy.  Further  research  centered  on  exploring  this 
connection  and  on  the  effect  of  these  measures  on  the  analysis 
of  detector  performance  would  be  desirable. 

The  relationship  between  Fisher's  Information  and 
asymptotic  detector  performance  was  utilized  in  Chapter  III  for 
the  design  of  minimax  detectors.  The  density  with  minimum 
Fisher's  Information  over  a  given  class  of  densities  is  the 
worst  case  density  within  the  class  for  detection.  The  optimum 
detector  for  this  density  is  the  minimax  detector  for  the 
class.  Several  density  classes  were  defined  and  conditions  for 
the  minimax  detectors  were  explored.  One  especially 
interesting  class  is  the  class  of  all  distributions  whose 
cumulative  distribution  functions  pass  through  k  given  points. 
Huber's  conditions  [2]  which  define  the  density  that  minimizes 
Fisher's  Information  over  this  class  were  presented.  This 
technique  appears  especially  promising  for  detection  purposes. 

A  possible  implementation  would  require  taking  data  points  and 


finding  the  density  with  minimum  Fisher's  Information  which 
passes  through  the  points.  Then,  the  optimum  detector  for  this 
density  can  be  used.  This  detection  procedure  should  be  quite 
robust . 

Minimax  techniques  can  be  useful  if  the  class  of  densities 
is  properly  defined.  However,  in  many  cases  it  may  be  more 
appropriate  to  find  a  detector  which  works  well  for  many 
non-Gaussian  densities  rather  than  to  find  the  one  that  works 
best  for  the  worst  case  density.  To  approach  this  problem  one 
could  examine  the  equation  for  efficacy  [Eq.  (3.39)]  for 
various  detectors  and  density  systems,  the  objective  being  to 
find  a  nonlinearity  which  yields  high  values  for  efficacy  for 
many  different  densities.  This  nonlinearity  should  then  be 
tested  by  simulation  to  determine  how  well  the  performance 
predictions  based  on  efficacy  carry  over  to  the  small  sample 
case . 

In  Chapter  IV  an  adaptive  detector  based  on  the  Johnson 
System  was  presented.  Simulation  results  were  displayed  which 
showed  that  this  detector  performs  well  for  several  different 
densities.  However,  to  truly  test  this  system,  actual 
impulsive  noise  data  should  be  used.  A  variation  of  this 
detector  could  also  be  considered.  The  tail  measure  p, 
rather  than  t,  could  be  used  to  estimate  the  parameter  6. 

Use  of  p  would  require  taking  only  one  quantile.  Since  this 
point  is  on  the  tail  of  the  density,  fewer  samples  would  have 
to  be  saved  to  find  an  estimate  for  the  quantile.  That  is, 
when  looking  for  the  .995  quantile  only  the  top  50  out  of 


10,000  points  need  be  saved;  while  looking  for  the  .975 
quantile  (required  for  estimating  t)  requires  saving  250 
points.  Fast  methods  to  estimate  both  the  quantiles  and  the 
rate  of  change  of  the  quantiles  [dR(p)/dp  where  R(p)  =  F  ^(p)] 
are  needed. 

As  briefly  mentioned  in  Chapter  IV,  the  tail  measures 
32,  t,  and  p  can  be  used  as  tools  for  the  comparison  of 
density  systems.  Different  densities,  whose  tail  behavior  are 
identical  as  measured  by  either  t  or  p,  could  be  compared 

in  terms  of  Fisher's  Information  or  the  performance  of  various 
detectors.  This  study  could  aid  in  the  selection  of  appropriate 
density  systems  for  noise  modeling. 
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